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AN INTEGRATED NEUROLINGUISTIC THEORY OF BILINGUALISM (1976-2000) 


Michel Paradis 
McGill University 


introduction. My very first paper published on bilingualism appeared in the Third 
lacus Forum (Paradis 1977). It was called'The stratification of bilingualism’. It contained 
the germ of the Three-Store Hypothesis and of the Subset Hypothesis, though these 
did not clearly occur to me as such until the Sixth and Seventh lacus Forum, respec¬ 
tively. The main point of that first paper, presented at the meeting in El Paso in 1976, 
was that each level of grammar of each of a bilingual’s languages could independently 
contain various degrees of unidirectional or bidirectional interference, in keeping with 
Weinreich’s (1953) coordinate, compound and subordinate organization. This paper also 
contained the seed of the idea, made more explicit later, that the contents of the repre¬ 
sentation (i.e., whether it contains illegitimate features of another language or not) are 
orthogonal to the representation of two languages in one brain. It dealt with the linguis¬ 
tic level of description—a description of the grammar of a given bilingual individual 
at a given time (Figure 1, overleaf). There were two main points: (1) the language 
system of a bilingual individual may contain, in various proportions at each stratal level 
(roughly corresponding to phonology, morphology, syntax and lexical semantics), all 
three types of interference (i.e., unidirectional, bidirectional, or none); (2) the state of 
bilingualism is not static, as the various interlingual relationships will change, over time, 
possibly differently at each stratum, as a result of experience. 

The Three-Store Hypothesis, as it was initially proposed in 1979 at the sixth lacus 
Forum in Calgary, was a psycholinguistic construct, separating the nonlinguistic cog¬ 
nitive store from the two language stores of bilingual speakers. It was meant to rec¬ 
oncile the contradictory findings of experimental psycholinguists of the 1960s and 
1970s who found evidence, now in support of one store, now of two stores (Figure 
2a, overleaf). Kolers (1968) reported that his subjects’ responses were neither fully 
compatible with the one-store nor with the two-store account, but rather were simi¬ 
lar to responses to synonyms in the same language. This led to the proposal of the 
Three-Store Hypothesis (Figure 2b, overleaf). The Subsystems Hypothesis, first pre¬ 
sented here at Rice University in 1980, and published in the Seventh lacus Forum 
(Paradis 1981), was a neurofunctional proposal, compatible with all the various recov¬ 
ery patterns of bilingual aphasic patients, including alternating antagonism and selec¬ 
tive aphasia, which were described for the first time some years later (Paradis et al. 
1982; Paradis & Goldblum 1989; Nilipour & Ashayeri 1989). It too assumes a separate 
neurofunctional cognitive system independent of the language system, and hence 
of the language systems of bilinguals. In addition, it assumes that language is an 
independent neurofunctional system, a neurofunctional module, receiving inputs 
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Figure 1 . The left shows the linguistic system of a predominately compund bilingual, while the 
right shows the linguistic system of a predominately subordinate bilingual. Both halves show oral 
encoding and aural decoding only. 

from the cognitive systems and providing outputs to the articulatory or digitomanual 
kinetic systems. Each language is a subset of the larger language neurofunctional 
system (Figures 3 to 9). 


1. laterality. In the 1970s and 1980s, experimental psycholinguists were exploring 
the possibility of less asymmetrical cerebral representation of one or both languages 
of bilingual speakers. Experimental findings continued to be contradictory in spite of 
the ever-shrinking relevant population, while there was no clinical evidence whatso¬ 
ever in favor of any difference of lateralization of language in any type of bilinguals 
(Paradis 1990,1992b, 1995). This led me to search for what the increased contribution 
of the right hemisphere could be, since it obviously was not subserving grammar. 


2 . the integration of pragmatics. After considering all possibilities (Paradis, 
1987), it appeared that the right hemisphere specialized in processing the pragmatic 
aspects of language use, which turn out to be at least as important as sentence gram¬ 
mar or the language system per se. The evidence comes from right-hemisphere-dam- 
aged patients with dyshyponoia (Paradis 1998). This adds a new component to the 
model, given that linguistic pragmatic features are language specific. There is thus 
another dimension to the possibility of unidirectional or bidirectional interference, 
namely the interpretation of L2 pragmatic clues with their li value, hence resulting 
in misunderstandings and faux pas. What is a polite form of request in one language 
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Figure 2 a. Two hypotheses about the way a bilingual person handles information are represented 
by two arrangements of tanks. One hypothesis (left) is that all his information is stored centrally, 
or in one tank, and that he has access to it equally with both languages, which are represented by 
the various taps. The other (right) is that his information is stored in linguistically associated ways, 
or in separate tanks. Experiments by the author indicated that the actual situation of a bilingual 
person combines parts of both hypotheses. (Kolers 1968 ) 


Figure 2 b. The actual situation of a 
bilingual person may be illustrated by 
a third hypothesis: both languages are 
differentially connected to the same con¬ 
ceptual-experiential information store 
(Paradis 1978 ). 
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L 



Figure 3 . Schematic representation of the lan¬ 
guage system (L) comprising 2 subsystems. Left 
of the vertical bar, English (E); right of the 
vertical bar, Japanese (J). 


L 



Figure 5 . The languages of a bilingual speaker, 
each a subsystem within the various language 
neurofunctional modules. Modules: P, S, L; 
submodules: EP, JP, ES, JS, EL, JL. 



L 



Figure 4 . Language as a collection of modules 
(neurofunctional subsystems). Only three mod¬ 
ules are represented for the sake of simplicity: 
phonology (P), syntax (S) and lexical seman¬ 
tics (L). 

L 



Figure 6. One language selectively inhibited. 


L 



Figure 7 . One module (here syntax) selectively Figure 8 . Selective inhibition of English pho- 
inhibited. nology. 


L 



Figure 9 . English phonology and Japanese syntax do not form a natural 
subsystem. Even though the internal structure ofES is independent of 
the internal structure ofofJS, they both instantiate parameters ofS (as 
opposed to parameters of P, for instance), which makes ES functionally 
more similar in nature to JS than to EP or JP. A cooccurence of lesions 
affecting at the same time EP and JS may not be altogether impossible, 
but it is very improbable, given the different loci subserving thephysiol- 
ogy of each neurofunctional submodule. It would take two different 
lesions (one affecting English phonology, the other Japanese syntax) to 
give rise to different aphasia, namely, qualitatively different symptoms in 
each language. 
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may be downright rude in another. ‘I like your tie’ is a compliment in English, but an 
indirect request for the tie in Chinese or Arabic. ‘Why don’t you sit down?’ is a polite 
invitation in English, but a rude question in Korean. ‘Have a cookie’ is a nice thing to 
say to a guest in English, but considered bossy and hence inappropriate in Navaho. 
Thus, not only do idiomatic expressions and proverbial phrases have different inter¬ 
pretations, but plain indirect speech acts do too. Not to mention differences in body 
language, facial expressions and prosody. What counts as a possible indirect speech 
act, what context determines the use of this or that form of address, conventional 
metaphors, conventional gestures and facial expressions, all vary in accordance with 
each language and culture. Thus, in a neurofunctional model of bilingualism, prag¬ 
matics also needs to be integrated. Each level of linguistic structure (prosodic, (seg¬ 
mental) phonological, morphological, syntactic and lexical-semantic) is affected by 
pragmatics during the encoding and decoding of utterances. The listener derives 
meaning from prosody, the selection of specific words (e.g., tu/vous, in French), syn¬ 
tactic constructions (e.g., passive vs. active; topicalized subject or object). A passive 
construction places emphasis on the patient rather than the agent of the verb and 
may be reinforced or contradicted by stress at the phonological level, in addition to 
prosody, facial expression, conventional gestures and the like. The use of tu, which 
will, in one context be interpreted as a sign of friendliness or intimacy, will, by the 
same hearer in another context, be interpreted as a sign of contempt. There is no avail¬ 
able empirical evidence as to whether a bilingual individual can lose access to the 
linguistic pragmatics of only one language. There is no evidence that would help us 
decide whether linguistic pragmatics is stored as one extended system or whether, as 
for the linguistic systems, the pragmatic features associated with each language con¬ 
stitute a subsystem of pragmatics. Our hypothesis is that they do, but this will have to 
be empirically verified. 

Because linguistic pragmatic inference/implicature procedures are specific to each 
language, we may assume that they form a natural class and that, as such, each is likely 
to form a subsystem connected to its respective language (subsystem). The schematic 
representation in Figure 10 (overleaf) is an attempt to visually represent the theoreti¬ 
cal construct referred to above. 

Pragmatics is an important dimension of bilingualism. Lack of pragmatic knowl¬ 
edge of L2 leads to communication breakdowns at least as important as lack of lan¬ 
guage (i.e., grammar) proficiency. In the same way that individuals with dyshyponoia 
encounter social problems, ignorance of L2 pragmatics (especially for speakers with 
good linguistic skills) is similarly disruptive. Speakers of a language as a second lan¬ 
guage may understand what native L2 speakers say, but not what they mean. In turn, 
they may use inappropriate speech acts or other pragmatic features and thus be mis¬ 
understood. 

3. conscious and unconscious processing. For the past century and a half, lin¬ 
guists and aphasiologists have used the term ‘language’ to mean the language system, 
the code, the grammar (i.e., phonology, morphology, syntax and semantics), which 
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Figure 10. Visual representation of theoretical constructs described in this paper. 

has come to be referred to as implicit linguistic competence. Language was thus 
assumed to be represented in circumscribed areas of the perisylvian region of the left 
hemisphere, since damage to these regions would result in phonological, morphologi¬ 
cal, syntactic or semantic deficits. Research on second language acquisition pointed 
to a double dissociation between the ability to do well on off-line metalinguistic tasks 
(say, on a school test) and on-line accuracy and fluency (say, in normal conversation). 
This and the double dissociation between aphasia and amnesia drew attention to pro¬ 
cedural memory, which underlies linguistic competence, and declarative memory, 
which underlies metalinguistic knowledge (Paradis 1994). Implicit linguistic compe¬ 
tence is acquired implicitly, that is, by focusing attention on something other than 
what is internalized: children focus on meaning and internalize an underlying syn¬ 
tactic structure that is not there to be observed; they acquire new sounds by focusing 









AN INTEGRATED NEUROLINGUISTIC THEORY OF BILINGUALISM (1976-2OOO) 


11 


on the acoustic properties of the input and end up internalizing the proprioceptive 
mechanisms that allow them to articulate those sounds, while remaining forever 
ignorant of which muscle does what, whether the vocal folds vibrate, what the vowel 
onset time is, and whether their tongue touches the side of their cheeks or the back of 
the palate. Competence is stored implicitly and remains opaque to introspection. It is 
used automatically, that is, without conscious control. Metalinguistic knowledge, on 
the other hand, is learned consciously, stored explicitly (and hence can be recounted), 
and used in a controlled manner. Competence and knowledge are subserved by dif¬ 
ferent cerebral mechanisms that reside in anatomically different loci. As a result, each 
is vulnerable to different physiological insult: some bilingual aphasic patients with 
reduced implicit linguistic competence but available metalinguistic knowledge para¬ 
doxically recover a previously weaker L2 over their li (Aglioti & Fabbro 1993). Con¬ 
versely, loss of L2 has been reported in Alzheimer’s Disease, a condition in which 
declarative memory is first affected. (Hyltenstam & Stroud 1993; Obler 1999). 

4. motivation. The extreme variability in second language learners’ success has 
drawn attention to the important dimension of motivation and its cerebral counter¬ 
part: the emotive dopaminergic structures and the amygdaloid complex within the 
limbic system (Schumann 1998). One of the major differences between acquiring a 
first language and learning a second is that children are highly motivated to under¬ 
stand speech directed at them and to make themselves understood: they speak 
because they want to convey a message. This first phase of the microgenesis of any 
utterance is missing in the classroom context where students do not ask‘Where is the 
pharmacy?’ because they have a headache and want to buy an analgesic, but because 
it is their turn to ask the question that corresponds to the picture on top of page 28 
(Paradis 1992a). The reduction in the release of dopamine at the onset of each utter¬ 
ance during the period of appropriation of the language may be an important factor 
in determining the extent of linguistic competence. Motivation has also been shown 
to be relevant in differential performance in aphasia. Individuals with global aphasia 
who are not able to put two words together under normal circumstances may cry 
out‘Shut the goddam door!’when annoyed at the noise or cold or wind coming from 
the outside. 

5. the activation threshold. Another component of the model, which is not asso¬ 
ciated with any particular anatomical area or functional system, but which is physi¬ 
ological in nature, and associated with all higher cognitive representations, whatever 
their domain, is the Activation Threshold Hypothesis. It proposes that an item is acti¬ 
vated when a sufficient amount of positive impulses have reached it. The amount nec¬ 
essary to activate the item constitutes its activation threshold. Every time the item 
is activated, its threshold is lowered and fewer impulses are required to reactivate it. 
Thus, after each activation, the threshold is lowered—but it gradually rises again. If 
the item is not stimulated, it becomes more and more difficult to activate. Attrition 
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is the result of long-term lack of stimulation. Motivation may have the same effect as 
repeated practice in lowering the activation threshold. 

Pathology (or aging) may disrupt the normal activation levels, causing, for exam¬ 
ple, word-finding difficulty. Aphasia would correspond to the blanket raising of the 
threshold for a system or subsystem or module, thus selectively or differentially affect¬ 
ing either the entire language system, or one language, or only the phonology, syntax, 
or lexical access in one language (see Figures 3 to 9). The hypothesis may account for 
recency and frequency effects, and priming phenomena, as well as language attrition. 

6. neurofunctional modularity. While the brain is one, and eventually everything 
is more or less indirectly connected to everything else, some portions of the network 
are dedicated to specific functions and, as such, are part of different neurofunctional 
modules. Each module can be selectively impaired by pathology. In the normal use 
of verbal communication, some modules process information in parallel, while the 
output of certain ones serves as input to others. Episodic and semantic memory, 
encyclopaedic knowledge, and information from various contexts serve as the basis 
for formulating messages which are optionally verbalized, involving at least four inde¬ 
pendent but collaborating neurofunctional systems (implicit linguistic competence, 
metalinguistic knowledge, pragmatics and motivation). The extent of reliance on each 
of these four systems will vary in the bilingual speaker’s two languages to the extent 
that proficiency in the second language differs from that in the native language. While 
all speakers will primarily rely on implicit linguistic competence to communicate 
automatically in their native language, they will differ among themselves with respect 
to their degree of proficiency in L2 and will compensate for the gaps in their linguistic 
competence by greater but differential involvement of pragmatics and metalinguistic 
knowledge. 

Thus the model presented at the third lacus Forum 24 years ago has grown in 
complexity and neurolinguistic relevance but remains valid at the linguistic level of 
analysis. The subsequent Three-Store and Subsystems Hypotheses, which got their 
world premiere presentations at subsequent lacus Forums, have been incorporated 
into the model, which has now been adopted by and large as the standard working 
hypothesis in the psycholinguistics and neurolinguistics of bilingualism (see, e.g., De 
Bot & Schreuder 1993; Green 1998; Grosjean 1998; Malmkiasr 6 - Anderson 1991; Pav¬ 
lenko 1999; Poulisse 1997). The linguistic processes may be sequential (albeit in rapid 
succession, in milliseconds) or run in parallel. Likewise, during decoding (the inter¬ 
pretation of language input), the pragmatic elements may be processed in succession 
(i.e., before or after linguistic processing) or in parallel, inferencing from each linguis¬ 
tic component (prosody, word selection, syntactic construction). Encoding may start 
with pragmatic considerations that guide the selection of linguistic elements, or pro¬ 
ceed in parallel, influencing the selection of elements from each level in turn, sequen¬ 
tially. In speaking, activation of the concepts must precede that of the pragmatic and 
linguistic choices selected to encode the (conceptual) message verbally. 
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Figure 11. Schematic representation of the current integrated neurolinguistic theory 
of bilingualism. 

Figure n, I guess, is what a schematic representation of the current integrated 
neurolinguistic theory of bilingualism could look like. 

conclusion. The moral of the story is that in a brain where virtually everything is 
connected to everything else, there are nevertheless portions of the network that are 
dedicated to the processing of specific functions. While their basic functioning prin¬ 
ciples are essentially the same throughout the brain, each dedicated portion of the 
network subserves its own set of representations, the internal structure of which is 
independent of that of other neurofunctional modules. The performance of a com¬ 
plex task depends on the collaboration of a number of independent neurofunctional 
modules, each capable of selective impairment. Each language system of a bilingual 
speaker is a complex subsystem comprising several modules. In addition, in the 
normal use of language, many other independent neurofunctional modules are neces¬ 
sarily engaged: pragmatic ability, metalinguistic knowledge, concepts, encyclopaedic 
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and episodic memory, as well as motivation. Conceptual mental representations are 
independent of language and, to the extent that the lexical-semantic constraints of 
items in each language differ from those of their translation equivalents, language 
subsystems are connected to some unshared conceptual features (Paradis 2000). 
There is not a single cerebral mechanism at work in the bilingual speaker that is not 
also operative to some extent in the unilingual brain. All speakers rely on the same 
cerebral substrates, albeit to different extents. There may be qualitative differences at 
the level of what is represented but there are only quantitative differences in how it is 
represented and processed. 
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PARTICLE PLACEMENT IN ENGLISH 
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A phenomenon that has attracted considerable interest in linguistics over the last 
decades is syntactic variation. I will investigate a particular instance of syntactic vari¬ 
ation, namely the word order alternation that is possible for English transitive phrasal 
verbs (TPVs). As an example, consider (1). 

(1) a. John picked up the book. construction 0 (C 0 ) 
b. John picked the book up. construction! (Ci) 

This alternation has frequently been referred to as Particle Movement. In this study, I 
will, as in Gries (1999,2000) use the term particle placement (PrtPl) in order to avoid 
the movement metaphor and its theoretical implications. 

Over the last 100 years, a large number of studies have been devoted to exploring 
this type of alternation. More precisely, they have attempted, first, to provide an ade¬ 
quate structural description of the two possible constituent orders and, second, to 
find the variables that determine native speakers’ choices governing the alternation 1 . 

Interestingly, a (at a superficial glance) simple word order alternation seems to be 
influenced by a large number of variables from many subdisciplines of linguistics. 
Table 1 (overleaf) gives an overview of variables that have been claimed to contribute 
to PrtPl. The middle column names the variable whereas the other columns provide 
the values/levels purportedly correlating with a preference for a construction. 

However comprehensive this list of variables may seem at first sight, there are 
three methodological issues I would like to address (for a conceptual critique of some 
of these variables, cf. Gries 2000). 

First, nearly all of the analyses have been monofactorial in nature. That is, most 
scholars have concentrated on one variable at a time and have attempted to support 
its assumed effect on PrtPl. Note that this does not mean they have not acknowledged 
that several variables are important; it only means that each variable’s effect has been 
investigated in isolation without concern for the simultaneous effect of other vari¬ 
ables. Let us consider an example. Fraser (1974:571) has argued that verbs without ini¬ 
tial stress entering into TPVs are preferred in construction!. In order to substantiate 
his claim, he provides the following minimal pairs and acceptability judgements: 

(2) a. ?I will insult back the man. 
b. I will insult the man back. 
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Value/Level for C 0 

Variable 

Value/Level for Ci 

stressed DO 

stress pattern of VP 

stressed particle 


phonetic shape of V 

no initial stress 


NP type of DO 

(semi-)pronominal 

definite 

determiner of DO 

indefinite / none 

long 

length of the DO 


complex 

complexity of the DO 


idiomatic 

meaning of the VPi 


inanimate 

animacy of the DO 2 

animate 

abstract 

concreteness of the DO 

concrete 

low 

entrenchment of the DO 

high 

long 

distance to last mention of the DO 

short 

low 

times of preceding mention of the DO 

high 

low 

cohesiveness of the DO to the preceding discourse 

high 

short 

distance to next mention of the DO 

long 

high 

times of subsequent mention of the DO 

low 

high 

cohesiveness of the DO to the subsequent discourse 

low 


following directional adverbials 

yes 

high 

production difficulty 

low 


Table i. Variables governing PrtPl (as suggested in the literature) 3 . 


(3) a. ?We converted over the heating to steam, 
b. We converted the heating over to steam. 

(4) a. ?Ihey attached up the tag on the wall, 
b. They attached the tag up on the wall. 

However, the question arises as to how Fraser knows that it is the phonetic form of the 
verb that is responsible for the purported preference of the (b) sentences rather than 
some other variables he also discusses in the same paper. The indicated preference of 
construction! in all three examples (if it is representative at all) could be due to the 
DO being simple, very short and definite rather than Fraser’s phonological variable. 
Obviously, the data Fraser cites simply do not warrant his conclusion as they fail to 
take into consideration these additional variables. This instance shows that, in com¬ 
plex cases like PrtPl, minimal pairs can distort the picture more than they are helpful 
in spite of their time-honoured place in linguistics. 

What is more, it is uncontroversial that, in complex phenomena (given the large 
number of variables mentioned above), independent variables can interact with each 
other with respect to the impact they have on the dependent variable. Such phenomena 
can hardly be identified by using classical minimal pair tests, which contributes further 
to the lack of an adequate description of PrtPl. Therefore, I advocate the use of multi¬ 
factorial techniques, which enable the researcher to describe PrtPl more efficiently. 
























PARTICLE PLACEMENT IN ENGLISH 


21 


Moreover, with few notable exceptions (Chen 1986; Gries 1999, 2000; Hawkins 
1991,1994) analyses of PrtPl have been based on intuitive and introspective analyses 
of grammaticality/acceptability judgements alone. This methodology may be found 
acceptable (and even rewarding) in particular research traditions (most notably trans¬ 
formational-generative grammar). From my point of view, however, this methodology 
is fundamentally flawed. To name one reason for this opinion, numerous publications 
have shown (c£, e.g., Labov 1975) that relying on grammaticality/acceptability judge¬ 
ments alone does not meet well-established standards of scientific research (objectiv¬ 
ity, validity, and reliability). Thus, my analysis will rely solely on naturally-occurring 
instances of PrtPl (from the British National Corpus). 

Second, in spite of many decades of research, there is still no account of PrtPl that 
tries to explain why speakers choose one construction over the other in a particular 
discourse situation. Such an account would, first, identify all relevant variables and 
subsume them under a (probably) small set of factors and, second, identify all irrel¬ 
evant variables on a principled basis and eliminate them from further consideration. 
There are several such analyses which have attempted to provide a unifying hypoth¬ 
esis for the main motivation of the alternation by relating PrtPl to issues of givenness/ 
topicality (which boils down to degree of activation) of the DO’s referent (Erades 1961; 
Chen 1986) or to purely syntactically conditioned processing effort (Hawkins 1991, 
1994). However, these previous investigations do not go far enough for two reasons, 
as follows. 

On the one hand, there are only two major empirical analyses of PrtPl (Chen 1986, 
and Hawkins 1991, 1994), and both suffer from several drawbacks. (The following 
discussion will only deal with two problems. For a more comprehensive critique, cf. 
Gries 2000). Both Chen and Hawkins operationalise the givenness of the DO’s refer¬ 
ent by counting co-referential expressions in the preceding context, a methodology 
advocated in numerous papers by Givon (cf, e.g., Givon 1983,1988). However, it has 
been shown beyond any reasonable doubt that the referent X of some expression is 
not only activated by strictly co-referential items. Rather, the inferential processes on 
the part of hearer in particular discourse situations also activate related concepts. 
While the notion of related concept can be criticised as being fairly vague, studies such 
as Clark (1977) on bridging and by Bolkestein (1985) as well as Bolkestein and Ris- 
selada (1987) on cohesiveness have shown that the referent X of some expression can 
in fact (depending on the particular characteristics of the discourse) be activated by 
hyperonyms of X, hyponyms of X, co-members of X’s level of categorisation, parts 
of X, the function of X, reasons leading to X, consequences following from X, etc. 
Therefore, analyses of the influence of givenness/topicality had better incorporate 
these variables if the investigation of givenness is to be valid. What is more, both the 
analyses by Chen and Hawkins have been carried out with little degree of statistical 
sophistication as, e.g., neither analysis uses any significance tests to support their far- 
reaching conclusions. 

On the other hand, Chens and Hawkins’ proposals also fail to account for a vari¬ 
ety of factors which cannot easily, if at all, be related to givenness/topicality or purely 
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syntactically conditioned processing effort, namely the idiomaticity of the VP, VP- 
final directional adverbials, and the concreteness of the DO, to name a few. Due to 
these and other drawbacks, an explanatory account of PrtPl that is both coherent and 
empirically supported cannot be found. 

Third, given the shortcomings just discussed, it comes as no surprise that it has 
not been possible to predict which construction a native speaker will choose. This 
is a problem that derives from the lack of truly multifactorial treatments of the sub¬ 
ject matter. 

In the remainder of this paper, I will propose a hypothesis that explains the con¬ 
structional choices of speakers given a particular discourse context (cf. section 2) and 
I will show how, by means of using multifactorial statistics, this hypothesis can be 
supported and native speakers’ choices of construction can be predicted with a high 
degree of accuracy (cf. section 3). 

1. the processing hypothesis. In this section, I propose a hypothesis to explain the 
distribution of the two constructions and outline which variables are, according to 
this hypothesis, relevant and which are not. 

The Processing Hypothesis (PH): By choosing one of the two constructions for an 
utterance U a speaker S formulates U in such a way as to communicate the intended 
message with as little processing effort as possible. More specifically, for most 
variables at least, this means that construction 0 will be preferred for verb-particle 
constructions (VPCs) with DOs requiring a lot of processing effort; construction! 
will be preferred for VPCs with DOs requiring little processing effort 4 . 

In order to investigate this hypothesis, two different steps are necessary. On the one 
hand, it must be shown how the above-mentioned variables relate to processing 
effort. This will be done in the remainder of this section. On the other hand, it must be 
shown empirically, that (i) the variables included by the PH correlate with PrtPl in the 
direction predicted above and (ii) the variables excluded by the PH do not correlate 
with or contribute to PrtPl; this will be the topic of section 2. 

1.1. phonological variables. The stress pattern of the VP can be straightforwardly 
related to processing requirements: in functional analyses of information structure, it 
has been useful to distinguish two kinds of information, namely given and new infor¬ 
mation. Generally, in English the new (and/or important) information is positioned 
sentence-hnally. Moreover, it is by now common ground that stress on a linguistic 
expression typically serves to indicate the newness or importance of the referent of 
this linguistic expression, i.e., to direct the attention of the hearer to the respective ref¬ 
erent, thereby increasing the processing cost associated with that referent. Combin¬ 
ing these two findings results in the distribution predicted in the PH: the expressions 
whose referents are to be processed thoroughly occur clause-finally and are stressed. 
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1.2. morphosyntactic variables. As far as the TPV (transitive phrasal verbs) is 
concerned, construction 0 facilitates processing because the speaker can complete 
(and thus disambiguate) the TPV earlier if the particle immediately follows the 
verb and need not be borne in mind until the DO has been uttered completely (cf. 
Rohdenburg 1996). But what about the DOs of TPVs? Let us start with the variable 
NP type of the DO. 

This variable is, in fact, quite closely connected to utterance processing: personal 
pronouns and semi-pronominal/referentially vague nouns are only used when their 
referents are identifiable and active, whereas lexical noun phrases are much more 
likely to be used with unused and brand-new referents. Again, the distribution is as 
predicted in the PH: active referents of personal pronouns require little processing 
effort and correlate with construction!, referents of lexical DOs are, on the whole, 
more likely to require more processing and occur preferably in construction 0 . 

Likewise, the variable determiner of the DO noun phrase is also concerned with 
processing aspects: 

Linguists traditionally deal with the binary distinction between definite and 
indefinite, with the former marking topics which the speaker assumes the hearer 
can identify uniquely, is familiar with, are within his file (or register) and thus 
available for quick retrieval. On the other hand, indefinites are presumably topics 
introduced by the speaker for the first time, with which the hearer is not familiar, 
which therefore are not available to the hearer readily in his file. (Givon 1983^.) 

Comment is hardly called for: definite determiners are said to prevail in construction! 
and indefinite determiners (used for unused or even brand-new referents requiring 
conscious activation) are said to prevail in construction 0 so that the pattern found is 
as expected. 

Length of the DO and complexity of the DO (irrespective of how these are mea¬ 
sured; cf. Wasow 1997) can be dealt with simultaneously. Obviously, longer/more com¬ 
plex NPs require more processing effort while shorter/less complex noun phrases can 
be more easily processed. But apart from this purely structurally motivated approach, 
there is also a functional principle at work: ‘the new information often needs to be 
stated more fully than the given (that is, with a longer, “heavier” structure)’ (Quirk 
et al. 1985:1361). Thus, if the newness of a referent, on average, renders DOs long and 
complex (in order to provide the necessary information for the hearer to establish a 
new referential file), the larger amount of linguistic material requires more processing 
effort than the one needed for given information. This, in turn, increases the likeli¬ 
hood that production and planning effects or mistakes such as hesitations, repeti¬ 
tions, and false starts can be observed in actual speech (cf. Arnold & Wasow 1996). 
Ultimately, both the length and its functional motivation go hand in hand so that 
again much processing effort is linked to construction 0 and little processing effort to 
construction!. 
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1.3. Semantic variables. We know that VPCs come in two different word orders, 
and we also know that the meanings of VPCs range from literal to idiomatic. In con¬ 
struction!, the particle is positioned in the canonical clause-final position for focal 
elements, so that the particle is processed more intensively than the DO. Thus, the 
word order of construction! naturally underscores the spatial contribution the parti¬ 
cle makes to the compositional meaning of the TPV in the utterance. Therefore, con¬ 
struction! is the natural choice for a speaker communicating a state of affairs where 
the spatial meaning is prominent 5 . 

Very often in the case of idiomatic constructions, the meaning of the TPV is not 
compositional. The particle does not just add some spatial information to a straight¬ 
forward sense of the verb (as with literal TPVs), but the idiomatic TPV conveys a 
meaning that is not a function of the meaning of its parts and their interrelations, 
but must be stored on its own. In other words, when the speaker accesses the mean¬ 
ing of the TPV for production, then the complete idiomatic TPV (i.e., verb and par¬ 
ticle) are simultaneously accessed. Thus it is only natural that the verb and the particle 
are uttered following one another directly: it would be uneconomical to process the 
opaque meaning of a TPV but produce the parts that trigger this opaque meaning in 
possibly widely disparate positions of the sentence 6 . 

Turning to the next variable, no significant effect of animacy of the referent of the 
DO on particle placement is to be expected according to the PH: there is no reason to 
assume that animate referents yield context-dependent processing requirements sub¬ 
stantially different from inanimate referents, and there is also no reason to assume that 
animate referents are more likely to undergo caused motion. Therefore, I will investi¬ 
gate animacy but consider it not to contribute to the constructional alternation. 

Finally, the variable entrenchment of the DO (as analysed in Gries 1999) will not be 
analysed directly because the entrenchment hierarchies offered in Deane (1992) and 
Gries (1999) comprise several variables that will be dealt with separately (and thus 
much more precisely) in this analysis. 

1.4. discourse-functional variables. The discourse-functional variables concern¬ 
ing the preceding context relate straightforwardly to matters of processing effort. 
Information that is given as a result of having (frequently and/or recently) been 
evoked in the preceding discourse or being readily inferable from the preceding 
context requires less processing effort than discourse-new or especially hearer-new 
information. Thus, the distribution of the two constructions (and their processing 
cost) relative to these variables is the one given above in Table 1. However, the 
discourse-functional variables concerning a subsequent context cannot be related 
to the processing cost of the VPC. Even if it were true that these variables measure 
the importance of the DO’s referent, it is difficult to see how a speaker should 
be able to foresee precisely the development of the discourse to follow (cf. also 
Hawkins 1994:225). Thus, I claim that these variables will neither be relevant to nor 
correlate with the processing cost of the utterance. 
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Spoken 

Written 

Row Totals 

Construction!, 

67 

127 

194 

Construction! 

133 

76 

209 

Column Totals 

200 

203 

403 


Table 2. Corpus data 7 . 

1.5. other variables. The next variable is concerned with the presence of a direc¬ 
tional adverbial following either the verb or the particle. If a directional adverbial fol¬ 
lows the VPC, then it typically serves to elaborate either the path along which the 
referent of the DO is being moved, (5)a, or the resultant location of the referent of the 
DO, (5)b and endnote 5. 

(5) a. So Tom took Peter along past the new Pump House, 
b. Fred put the book down on the table. 

For construction!, where the spatial meaning is foregrounded (cf. above section 
2.3), it is therefore quite natural to expect additional material (in the form of a direc¬ 
tional adverbial) providing additional information on the direction or the endpoint of 
the movement process. On the other hand, construction o does not normally denote a 
movement process that can be further elaborated with information concerning direc¬ 
tionality so following directional adverbials are, though not strictly ruled out, not to 
be expected. Thus, the distribution predicted in the PH seems to be fully justified. 

2. the statistical investigation: data and results. In order to empirically test the 
PH conforming more to the standards mentioned in section 1 ,1 studied a set of 403 
VPCs from the BNC. Table 2 gives an overview over the sentences that were analysed. 

By analysing the 403 sentences and their context (the 10 preceding and 10 subse¬ 
quent clauses) 8 , each VPC was assigned values representing the values/levels of the 
variables listed in Table 1 9 . Then, two different procedures were used: first, for each 
variable, a strictly monofactorial coefficient of correlation was calculated (section 
2.1). This may seem surprising, given that I have devoted some space to arguing that 
monofactorial analyses are ill-suited for problems of such a level of complexity. The 
reason for including monofactorial correlations is that I wanted to subject all vari¬ 
ables to an empirical test, as few variables have already been tested on naturally- 
occurring data. Second, a multifactorial analysis was conducted in order, first, to find 
out how much variance of PrtPl we can account for (given the variables we know 
of) and, second to try to predict native speakers’ choices of a construction in natural 
discourse (section 2.2). 

2.1. monofactorial results. Due to the different levels of the variables within the 
analyses, a variety of different coefficients of correlation had to be computed. To cut a 
long story short, Table 3 contains the correlation coefficients for each variable. 
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Variables that were analysed empirically 

Correlation coefficient 

Complexity of the DO (complex) 

y=-o.847 *** 

Idiomaticity of the verb phrase (idiomaticity) 

y=-0.6 *** 

Length of the DO in syllables (LengthS) 

r p bis=-0.504 *** 

NP Type of the DO (type) 

((>=0.492 *** 

Distance to last mention of the DO (ActPC) 

r P bis=o.452 *** 

Length of the DO in words (LengthW) 

r P bis=-o.45 *** 

DO’s cohesiveness to the preceding discourse (CohPC) 

r p bis-0.429 *** 

Times of preceding mention of the DO (topm) 

r P bis=o.4i5 *** 

Last mention of the DO (lm) 

<(>=0.411 *** 

Overall mention of the DO (om) 

r p bis=o.358 *** 

Concreteness of the DO (concrete) 

4 >=o .339 *** 

Determiner of the DO (det) 

((>=0.319 *** 

Directional adverbial following the DO (PP) 

((>=0.268 *** 

Times of subsequent mention of the DO (tosm) 

r P bis=o.i92 *** 

Animacy of the DO (animacy) 

((>=0.166 *** 

DO’s cohesiveness to the subsequent discourse (CohSC) 

r p bis=o.i43 ** 

Next mention of the DO (Nm) 

((>=0.104 * 

Distance to next mention of the DO (ClusSC) 

rpbis — 0.1 

Production and planning effects (disfluency) 

lpbis=-0.006 ns 


Table 3. Monofactorial results of the analysis. 


Generally speaking, the higher the absolute value of the correlation coefficient, 
the more important this variable is for PrtPl. But before we begin to interpret these 
results, a comment concerning the significant correlation between the variable ani¬ 
macy and PrtPl is necessary. This result might be taken as evidence that the PH is 
not fully correct since I predicted that animacy should not play a role. This, however, 
would be mistaken as the variable animacy not only tells us whether the referent 
of the DO is animate or not; it also tells us something about the concreteness of the 
DO: if the referent is animate, it must be concrete. Thus, if we want to test the influ¬ 
ence of Animacy alone, we need to take out the influence of concrete on animacy. 
There is a statistical technique serving this purpose, namely that of partial correla¬ 
tions, and if we do that, then Animacy does not have any influence on PrtPl anymore: 
r animacy construction • concrete — 0.04, t4oo — 0.757* P — 0.449 os. In less technical terms: 
animacy only had a significant effect because it tells us something about the con¬ 
creteness of the DO’s referent: if we take out that information, animacy turns out to 
be irrelevant while concrete remains important (as predicted by the PH). This is a 
good example of how monofactorial analyses can result in ‘statistical evidence’ that, 
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on closer inspection, turns out to result from the artificial separation of variables that 
are actually closely related. 

While it is not possible to go through all of the results in detail, some important 
conclusions can be drawn from the monofactorial analysis. First, we find a rough 
ranking of variable groups: morphosyntactic variables are the most important ones, 
followed by semantic variables and discourse-functional variables pertaining to the 
preceding context; least important are the discourse-functional variables concerned 
with the subsequent discourse. Second, on the whole, the PH receives strong support: 
the variables included by the PH all correlate with PrtPl, one, significantly and, two, 
in the predicted direction. Third, there are some variables (tosm, CohSC, nm and 
ClusSC) which correlate significantly with PrtPl although the PH has predicted oth¬ 
erwise (interestingly the correlation is exactly the opposite one than that observed by 
Chen). In this respect, the PH is not supported, but we will return to these variables 
below. Finally, we have, for the first time, a clue as to the strength of the individual 
variables in isolation, a level of precision not attained so far. 

The following section, however, will approach PrtPl in yet a different way. 

2.2. multifactorial results. While the results of section 2.1 are already a major 
leap forward (given the absence of similarly thorough analyses so far), they are still 
not quite satisfactory. The problem is, as was already mentioned, that monofactorial 
analyses do not enable us to address the issue rewardingly. Consider the choice of 
construction from the perspective of speech production: Obviously, no given native 
speaker computes a series of monofactorial correlations or weighs variables indepen¬ 
dently in order to reach a decision as to the construction to be uttered (although the 
method of monofactorial analyses seems to imply just that). Rather, for the native 
speaker, all variables’ values/levels are present at the point of time where the construc¬ 
tional choice takes place. Thus, any cognitively realistic account needs to incorporate 
all the variables simultaneously rather than artificially isolated. 

Let us first try to measure the overall success of the research on PrtPl in a mul¬ 
tifactorial way. Using the General Linear Model (a generalisation from techniques 
such as regression analysis and analysis of variance), we can assess the amount of 
variance of the dependent variable PrtPl that can be explained both by all variables 
ever postulated and, more interestingly for our purpose, for the PH. Consider Table 4 
(overleaf) 10 . 

In sum, the adjusted multiple correlation coefficient R c for the variables included in 
the PH is not only highly significant; it is even slightly higher than R c for all variables 
ever postulated to influence the alternation. In less technical terms, if we include vari¬ 
ables into the analysis other than those required by the PH, what we get is random noise. 
Thus the PH receives in this case overwhelming support. However, we would still like to 
know exactly how the variables behave and how we can predict speakers’ choices. 

With a so-called discriminant analysis, we can enter all variables into an equation 
and figure out how important each variable is (by means of a factor loadings) and 
what choice of construction the analysis would predict. Table 5 provides the results 
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Statistical index 

All variables 

Variables of the PH 

R 

0.79 

0.761 

Rc 

0.719 

0.723 

F 

F94, 308=44 

F53.349=9.074 

P 

<0.001 *** 


Table 4. Multifactorial results of the analysis. 


Variables of the discriminant analysis 

Loading 

Effect on PrtPl 

Length of the DO in syllables 

-0.545 

high variable values —> 
construction,) 

low variable values -4 
construction) 

Lexical DO 

-0.496 

Intermediate complexity of the DO 

-0.477 

Length of the DO in words 

-0.47 

Idiomatic VP 

-0.323 

Indefinite determiner of the DO 

-0.281 

Production/Planning difficulty 

-0.006 

given the low factor loadings (fl) 
(-o.22<fl<o.22), these variables 
do not discriminate well 

between the two constructions 

Semipronominal DO 

0.086 

Distance to next mention of the DO 

0.093 

Is there a next mention of the DO? 

0.097 

DO’s cohesiveness to the subs. disc. 

0.134 

Animacy of the DO 

0.157 

Times of subs, mention of the DO 

0.183 

No determiner of the DO 

0.222 

high variable values —> 
construction) 

low variable values —» 
construction,) 

VP-final directional adverbial 

0.277 

Literal VP 

0.308 

Concreteness of the DO 

0.336 

Is there a last mention of the DO? 

O.42 

Times of prec. mention of the DO 

0.426 

DO’s cohesiveness to the prec. disc. 

0.443 

Distance to last mention of the DO 

0.473 

Pronominal DO 

0.494 

Simple DO 

0.571 

Wilks’ Lambda=o.465; canonical correlations.73; p<o.ooi *** 


Table 5. Results of the discriminant analysis for all variables. 


Composition of the 
learning sample 

Composition of the 
prediction sample 

Correct predictions 

200 oral + 150 written sentences 

53 written sentences 

81.1% 

150 oral + 200 written sentences 

53 oral sentences 

67.9% 

174 oral + 176 written sentences 

26 oral + 27 written sentences 

88.7% 


Table 6. Prediction accuracy of three analyses. 
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of a discriminant analysis for all variables (nominal and ordinal variables were 0, 
1-coded; variables failing the tolerance test were omitted). 

Given space limitations, three findings deserve attention: first, again all variables 
included into the PH correlate with PrtPl strongly and in the predicted direction. 
Second, the overall ranking of the variable groups found in the monofactorial analysis 
is preserved. Lastly, the most realistic multifactorial analysis further supports the PH 
in that all the variables that I have argued to be irrelevant (namely animacy and all 
the discourse-functional variables concerned with the following context) do indeed 
get only very low factor loadings. 

What is more, when we perform a second discriminant analysis including only the 
variables of the PH, then we can test how accurate the predictions of constructional 
choices from the PH really are. If we enter all 403 cases into the analysis, the discrimi¬ 
nant analysis computes a constructional choice, and we can compare the accuracy 
of this classification with the ‘real’ choices by native speakers in their discourse set¬ 
tings. The classification accuracy resulting from this procedure is very high: 85.9% 
of all the 403 utterance are classified correctly. However, this is not really prediction 
since the cases to be classified also figured in the analysis from which the discrimi¬ 
nant function was derived. Therefore, I have pursued two ways to further support 
the argument of the predictive power of the PH. 

First, I split the sample into two parts, one consisting of 350 cases, the other of 53 
cases. The former sample was a learning sample to which I applied a discriminant 
analysis to obtain a discriminant function; the latter sample was a prediction sample 
whose constructions were predicted on the basis of the discriminant function derived 
from the learning sample. In order to anticipate criticism of my possibly biased choice 
of the samples, I performed this test three times with different learning and predic¬ 
tion samples. Table 6 shows the composition of the samples and the results. 

Obviously, the cross-validation also strongly supports the PH: on the whole the 
constructional choices in the prediction sample can be predicted quite accurately on 
the basis of the learning sample (with oral data being most difficult to predict, which 
is, I think, due to the more spontaneous and interactive nature of oral speech as com¬ 
pared to planned writing). 

However, the ideal way of testing the predictive power of models is via cross- 
validation, using the leave-one-out method (also called U-method). If we apply this 
procedure to the present data, we perform 403 analyses in each of which we predict 
the choice of construction in a single case on the basis of the remaining 402 cases. This 
again guarantees that no case is used for its own prediction. The result of this cross- 
validation for the present analysis is a prediction accuracy of 83.9%. This result is for all 
practical purposes very much the same as the one we had for the classification accuracy 
and the split-sample technique, which shows that the results are quite robust and the 
predictive power of all the variables together is indeed exceptionally high. 

3. discussion/conclusion. We have seen that the Processing Hypothesis subsumes 
more variables under a single explanatory account than any other previous hypoth- 
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esis. While the present analysis is not the first one to relate PrtPl to processing cost 
(cf. Hawkins 1994), it is important to notice that it is the first one that integrates all 
previous variables rather than excluding them a priori. We have also seen that the 
PH excludes several variables from further consideration. Both of these predictions 
are borne out by the data to such an extent that we can now, on the basis of an argu¬ 
mentatively sound hypothesis correctly predict 84% of native speakers’ choices for a 
particular construction in a natural discourse setting. 

One question, however, remains unanswered. The PH was neutral with respect to 
whose processing effort we were concerned with although it would be preferable to be 
able to explicitly relate the processing cost associated to the variables under investiga¬ 
tion to one or both of the interlocutors. However, as Arnold et al. (2000) have shown, 
this is hard to achieve since what is beneficial to the speaker is very often beneficial 
to the hearer, too. To name just one example, long complex DOs require more pro¬ 
cessing for the speaker, so he might prefer it to process them once everything else 
has already been processed. At the same time, hearers also prefer the sentence-final 
position for complex DOs in order to reduce their memory load in parsing. Thus, 
the question of shoes processing cost we are primarily concerned with awaits further 
(experimental) research. 

More importantly, however, are probably the methodological ramifications. The 
study has shown that multifactorial analyses of naturally-occurring data can go way 
beyond previous kinds of analyses. Not only are these techniques more likely to yield 
cognitively realistic results, they also enable us to identify spurious correlations (i.e., 
statistical artefacts, recall the variable animacy) and, last but not least, compare the 
predictive power of competing analyses. Suppose that some researcher claims to have 
found additional variables influencing PrtPl or a hypothesis making different pre¬ 
dictions. In that case, we simply analyse a set of naturally-occurring sentences with 
respect to all variables (his/hers and mine) again and test which hypothesis results in 
a higher prediction accuracy. In the absence of additional indications to the contrary, 
the analysis with a higher degree of predictive power is to be preferred. In conclusion, 
I hope to have also shown how rewarding a truly multifactorial perspective on the 
analysis of syntactic variation can be. 


Here as well as in the remainder of the paper, the expressions choice of construction or 
speakers’ decisions are not meant to imply that there is necessarily a conscious choice on 
the part of the speaker. 

Of course, animacy and concreteness of the DO refer to the animacy and the concreteness 
of the referent of the DO. 

It is also interesting to note in passing that nearly all of these variables have already been 
identified single-handedly by Van Dongen (1919). Put differently, not much progress has 
been made since then as, unfortunately, many analyses have failed to notice this and other 
early grammarians’ insightful works. 
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4 This formulation leaves open the question of whose processing effort we are concerned 
with. This question is not relevant to the subsequent discussions, but it will be addressed 
briefly in section 3. 

5 Construction, apparently instantiates a subtype of the caused-motion construction in 
Construction Grammar (cf. Goldberg 1995 chapters 3 and 8), which has the following 
basic meaning: ‘the causer argument directly causes the theme argument to move along 
a path designated by the directional phrase; that is, X causes Y to move Z’ (Goldberg 
1995:152). Similarly, in TPVs like, say, Fred brought the book back, the causer {Fred) directly 
causes the theme argument (the book) to move along a path or up to a point designated by 
the directional phrase (back). For this reason, it also follows that concrete referents of DOs 
will preferably occur in construction, while abstract objects probably will not: concrete 
referents can undergo caused motion whereas abstract referents cannot. Finally, note that 
Goldberg’s caused-motion construction is metaphorically related to the resultative con¬ 
struction, which fits previous observations that TPVs can also license a resultative mean¬ 
ing as well. 

6 This claim is supported by the independent observation that idiomatic expressions are in 
general much less susceptible to syntactic rearrangements than literal expressions. 

' The majority of the VPCs investigated consist of the most frequent verbs and particles 
entering into VPCs. The required frequency data in turn result from my own collection of 
1,357 different TPVs. 

The question may arise why ‘only’ 403 examples were used for the analysis. First, it 
has to be observed that, with 403 cases, this is by far the largest quantitative corpus-based 
analysis of particle placement ever reported (cf. Hawkins’ (1994) analysis of a mere 179 
cases or Chen’s (1986) analysis of only 239 cases). Additionally, the results will show that 
the predicted effects are all quite strong and highly significant, supporting my claim that 
the number of cases is in fact not too small at all. 

8 In order not to overly minimise the context, I did not count questions tags, discourse 
markers (e.g .you know, I mean) and repetitions / false starts. 

9 The variable semantic modification of the particle (with words like right) was not investi¬ 
gated because the corpus data did not contain a single example of such cases. The influ¬ 
ence of stress was not investigated since the available corpus data were not phonologically 
annotated. 

10 The analysis included two- and three-way-interactions; full-factorial designs of this com¬ 
plexity are difficult to calculate and even more difficult to interpret. 
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introduction. In the study of word-formation, the possibility of using large text 
corpora has in the last few years opened whole new avenues of research. Whereas ear¬ 
lier studies have pointed out and explained different kinds of phenomena and types 
of word-formation, it has been somewhat difficult to present in an exact and detailed 
manner the extent and proportion of e.g. changes in a language, let alone to prove 
the validity of some of the propositions made in this field (sometimes possibly based 
on intuition plus only a few examples). The historical aspects in some of the basic 
textbooks of word-formation, for example, Marchand’s (1969), are heavily based on 
findings from the Oxford English Dictionary, and at the time of the books release, 
Marchand’s perceptive observations undoubtedly seemed to have a very solid base. 
Now that we have the ability to do more computerized searches from large corpora 
and thus to focus on even the smallest units in detail, the future will no doubt see, to 
some extent, a more in-depth rewriting of the basics of English word-formation and, 
as Kennedy (1998:6) has pointed out, altogether new theoretical generalisations. 

From the viewpoint of corpus linguistics, word-formation seems to be a field 
where the topics of research may set rather high requirements for corpus size in order 
to achieve scientifically valid findings. What is a suitably large corpus depends, of 
course, very much on what one is investigating. Some word-formation types (e.g. 
blending) can be difficult to examine because of the relative infrequency of such 
words in almost any collection of texts. Suffixation in general is an issue that has been 
examined from different aspects (see e.g., Biber et al. 1998:63), but studies on the use 
of individual suffixes are fewer. In this paper I will outline earlier studies on the varia¬ 
tion between adjectives ending in -ic and -ical, and introduce my current work on the 
topic, with special attention given on their use in the Early Modern English period. 

1. the suffixes -ic and -ical. The incidence of the suffixes -ic and -ical presents a 
complex picture in the vocabulary of present-day English, and there are cases in which 
both suffixes are used in connection with the same root, as in analytic/analytical and 
philosophic/philosophical. The existence of such competing adjective pairs has been 
observed by several grammarians, one of the first being James Elphinston in 1765, 
who comments on some differences between the general semantic content of the two 
endings. The issue has been looked into by late 19th century grammarians and 20th cen¬ 
tury lexicographers, but at closer examination it becomes evident that the question of 
when each variant should be used is unanswered. The aim of my research is to study the 
ways in which such adjectives have been used in different times, and how their use has 
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changed, describing the different currents of change and the linguistic factors involved 
(syntactic, semantic, etc.) and also to see why such changes took place. 

The models for the English suffixes -ic and -ical were the French words ending in 
-ique, Latin words ending in -icus and -icalis, and Greek words ending in -ikos. The 
first words with the suffix -ic appeared in Old English, and the -ical formations appear 
in the Middle English period. The present-day situation of free variation between 
some adjectives in -ic and -ical (e.g., in the British National Corpus, the numbers of 
occurrences for pedagogic and pedagogical are roughly equal, and there seem to be no 
apparent differences as to their meaning or collocations) is a result of the fact that in 
the Early Modern English period, both endings were used to produce new adjectives, 
the functions of the endings being almost identical (‘of, belonging or pertaining to, 
resembling X’). 

Earlier accounts on the developments in the use of -ic and -ical may seem somew¬ 
hat confusing: for example, some of Marchand’s comments suggest that forms in -ical 
would be in general more common than the -ic forms, and in the case of competing 
pairs, the longer forms are ‘stronger’. Tie also states that, as a result of competition bet¬ 
ween two synonymous adjectives in -ic and -ical, the shorter forms are usually the ones 
that have been thrown out (1969:242). These comments are in opposition to Earle’s 
(1892:404) observations, according to which the appearance of both nouns and adjecti¬ 
ves in -ic caused‘a demand for an adjectival form which should be equivocal’, resulting 
in ‘a strain of adjectives in -ical which is one of the notes of the literature of the sixteenth 
and seventeenth centuries, and which has been largely discarded in recent times’. In 
addition, it has been claimed that the more recent the coinage, the more likely it is to 
have the -ic rather than the -ical ending (e.g.. Swan 1980:302). 

1.1 productivity of the suffixes. To investigate the productivity of -ic and -ical 
in different periods, I studied the cd-rom version of the OED, and the first citations 
of -id-ical adjectives, modelled after Bauer’s similar examination of the diachronic 
productivity of -merit (1998:417-19). One can agree with comments on the dangers 
of inaccuracies in such a research, but it can nevertheless be assumed that the result¬ 
ing patterns are indicative of the actual productivity (see Gorlach 1991:171; Bauer 
1998:417). Given the very large number of -id-ical adjectives in the OED, it was con¬ 
sidered sufficient to examine a small fraction of them. The numbers of first citations 
of -id-ical adjectives beginning with the randomly chosen letters a, c,f and u are 
presented in Figure 1 1 . 

The first rather peculiar observation one can make from this figure is that both 
curves have two peaks, which is also what Bauer (1998:418) discovered with the suffix 
-merit. Bauer concludes from the two-peaked figure of -merit formations that‘produc¬ 
tivity may change over time in ways that are not directly related to the number of 
available bases’. The changes in the productivity seem to correspond with the overall 
numbers of new words coined in these periods: Gorlach (1991:137-38) explains that 
the reasons for the growth period of the English lexicon between 1570 and 1630 were 
‘the need to express new ideas in English’ and‘the subjective desire to enrich the rhe- 
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Figure 1. The numbers of first citations of words with the suffixes -ic and -ical in the OED. 

torical potential of the vernacular’. He also remarks that as a result of this period, 
characterized by linguistic experimentation, there was a great deal of redundancy 
‘often on the basis of competing derivational patterns’. 

It can further be observed from Figure 1 that between 1500 and 1700, roughly the same 
number of new adjectives were coined with -ic and -ical endings. Considering especially 
the latter half of the 16th century, it can be said that the increase in the coinages in -ical 
formations began before that of the shorter forms. During the latter half of the 17th cen¬ 
tury, new words in -ic were coined at an increasingly faster rate, and slightly more new 
words in -ic were recorded than coinages in -ical, but the longer suffix still seemed to be 
productive. In the first half of the 18th century, the numbers of new coinages drop for 
both suffixes, which again follows the numbers of all new coinages between 1680 and 1780, 
as reported by Gorlach (1991:137). In the 19th century, -ic becomes dominant over -ical as 
far as new coinages are concerned, to a great deal because of its high productivity in new 
scientific terms, especially in chemistry (e.g., acidic, allylic, cresylic, cyanic, etc.), with -ical 
forms also resulting in a slight second peak. 

1.2. competition between the suffixes. In the case of competing adjective forms, 
language has been said to change so as to rid itself of superfluous items, either by 
discarding one of the forms or by assigning the two forms differing senses (Fowler 
1926, s.v. -ic, -ical; Marchand 1969:241-2). Some comments have been made on the 
preference for either -ic or -ical in certain types of adjectives: e.g., according to 
Sweet (1891:490), -ic is preferred in forming adjectives from proper nouns and names 
of races, as in Aristophanic, Byronic, and Slavic. Marchand (1969:243) singles out a 
group of adjectives mainly in -ic consisting of words with the basic sense ‘queer, 
odd, spleeny’, including, e.g ,,frenzical, nonsensical, and quizzical. In general, whenever 
there are pairs of adjectives in -id-ical, it has been claimed that the forms in -ic are 
semantically closer to the base word, whereas with the -ical forms the connection is 
vaguer (Fowler & Fowler 1951, s.v. -al), and based on this idea Marchand (1969:242) 
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suggested that scientists are more prone to use the shorter forms than non-scientists 
(see also Zandvoort 1966:314-5). The first grammarian to comment on the -id-ical 
variation was Elphinston, who said that ‘ ic is a foreign, and ical a domestic termi¬ 
nation. The former therefore is used upon solemn, the latter on familiar occasions; 
as seraph seraphic or seraphical, microscope microscopic or microscopical. Where the 
subject then is naturally solemn, the solemn ending prevails; and where familiar, the 
familiar. So we say almost only majestic, miltonic, from majesty, Milton, &c. and whim¬ 
sical, finical, from whimsy, fine, &c. (Elphinston 1765:323-24). All these comments 
are worthy of closer examination, both from a diachronic and synchronic viewpoint. 
More recently, studies have been also made on semantically differentiated pairs in 
-id-ical, including Marsden (1985) and Kaunisto (1999). 

Gorlach comments on the linguistic experimentation of 1570-1630 that ‘this pro¬ 
liferation was not cut back until the late seventeenth/eighteenth centuries, as a 
consequence of natural selection or as a result of grammarians’ or lexicographers’ pre¬ 
scriptivism’ (Gorlach 1991:138). As mentioned earlier, in addition to the ending of the 
‘proliferation, redundant items were dropped from use: Earle (1892:405) observed in 
his discussion on discarding forms in -ical (e.g., domestical, Germanical, magnificat) 
that this practice as regards -id-ical adjectives was not uniform, saying that ‘it almost 
excites a surprise to find that after all we have been rather arbitrary in our discontinu¬ 
ance of some, while we have continued to use others whose case is nowise different’, 
examples of surviving forms being periodical, statistical, surgical, etc. 

We could then raise the following questions: 

1. Is there anything to explain which of the adjective pairs were ‘dealt with’ by either 
dropping one of the forms or by semantic differentiation? Why would, for exam¬ 
ple, semantic differentiation have been a better or more natural way to get rid of 
redundancy in the case of some adjectives than with some others? 

2. If one of the forms is dropped from use, why is it that with some adjectives 
it is the form in -ic (e.g., we do not usually use the form mathematic, but 
mathematical), and then with some others the -ical form is dropped (e.g., 
tragic instead of tragical )? 

3. How drastic or quick were the changes in the use of such adjective pairs? 

2. THE USE OF THE SUFFIXES IN THE CHADWYCK-HEALEY LITERATURE ONLINE 

corpus. To investigate the changes in the use of -id-ical adjective pairs in earlier 
centuries, the most suitable material to use at the moment is the prose section of the 
Chadwyck-Healey Literature Online text collection, a commercially available corpus. 
This is simply because there are no other corpora available from this period that 
would be large enough—the prose section of the Literature Online contains altogether 
approximately 33 million words 2 . What needs to be mentioned is that the choice of 
this material for the study means, of course, that we are only dealing with fiction, 
and the results may not be exactly the same if texts of other genres were examined. 
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majestic 

majestical 

Number of authors 
using both forms 

1550-99 

1 (1) 

14 (10) 


1600-49 

15 ( 7 ) 

19 (8) 

2 

1650-99 

65 (15) 

3 ( 3 ) 

1 

1700-49 

33 ( 9 ) 

— 


1750-99 

70 (21) 

— 


1800-39 

149 (22) 

2 (2) 

2 


Table 1. The numbers of occurrences o/majestic/majestical in Literature Online. 

The prose section of the Chadwyck-Healey collection covers the period between 1513 
and 1839. As the corpus was obviously not compiled especially to meet the requirements 
of linguistic corpus research, it may not be ideal as regards its structure. The use of -icl- 
ical pairs in the corpus was analysed in blocks of 50 years. It is important to note the 
numbers of books and authors included in the corpus: the numbers of books in dif¬ 
ferent 50 year blocks varied from 43 to 62, and the numbers of authors from 24 to 35 
(a curious exception is the period 1700-49, which has 55 books by only 12 authors). In 
addition, texts by anonymous writers were not included in the analysis. Partly as a result 
of this decision, the number of books/authors in the corpus from the first half of the 
16th century was so small that this period was left out of the analysis entirely. 

After making online searches of -id-ical adjectives, based largely on Lehnert’s 
(1971) reverse dictionary, I studied the use of those adjective pairs which were the 
most frequent in the corpus, i.e., occurring frequently enough so that observations 
could be made on them. Thus the analysis did not consider adjectives which did not 
have an existing counterpart in -ic or -ical. The analysed adjectives include angelic(al), 
authentic(al), comic(al), domestic(al), fantastic(al), heroic(al), magic(al), majestic(al), 
philosophic(al), poetic(al), tragic(al), and tyrannic(al). Of these adjectives, authentic/ 
authentical had the smallest numbers of occurrences, with 80 instances of authentic 
and 27 instances of authentical. Pairs with a smaller number of total occurrences in 
the corpus were excluded from the analysis. It has to be noted again that if other 
genres were examined, the most frequently occurring -id-ical adjective pairs could 
perhaps be different. 

Firstly, attention could be paid to the numbers of occurrences of majestic/majestical 
in the Literature Online corpus, presented in Table 1. In Table 1, the numbers given under 
the adjective forms denote the absolute number of occurrences of the forms in differ¬ 
ent fifty-year periods, and the numbers in parentheses denote the numbers of authors 
using the adjectives. On the right hand side of the table are the numbers of those 
authors who were using both forms. As far as the semantics of the adjectives here 
are concerned, both majestic and majestical were used in the same meanings, as can 
be seen in examples (1) a and b (see also OED, s.w.), found in the Literature Online 
corpus: 
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authentic 

authentical 

Number of authors 
using both forms 

1550-99 

— 

11 (6) 


1600-49 

3 (2) 

10 (6) 


1650-99 

24 (11) 

6 ( 3 ) 


1700-49 

16 (5) 

— 


1750-99 

15 (11) 

— 


1800-39 

22 (9) 

— 



Table 2. The numbers of occurrences of authentic/authentical in Literature Online. 

(1) a. Though her Beauty had (while present) suspended Cha-abas Passion 
towards the Sultana for that time, he had no sooner retired from her 
Company, but that the Sultana presented her self, to his Imagination, with 
all those Perfections, and that Maiestick Grace which always accompany 
her. (Peter Bellon, The Court Secret (1689), 332) 
b. .. .they arriued shortly at the place of triumph, into which entring, bowing 
her selfe vnto her horse necke, vnto those she first met, she passed her 
horse by ye tilt, with such a countenance, and maiestical grace, that euery 
one admired, both her persone, and good demeanour... (Henry Roberts, 
Honovrs Conquest (1598), 137) 

In the periods when both forms were actively used, there were no signs in the texts 
of clear preference for the use of either form in any particular sense or collocation. 
Similarly, there seemed to be no significant difference as to the positioning of the 
adjectives: both forms appeared roughly with the same relation in attributive and 
predicative positions. It can be seen from Table 1 that in the latter half of the 16th 
century, the -ical form was more common—and this seems to be the case with most 
of the -id-ical adjective pairs examined—but later in the 17th century, the use of the 
shorter form became more popular, and the -ical forms less frequent towards the end 
of the 17th century, finally almost disappearing. In present-day English, majestical is 
indeed quite rare. So here we would have evidence of a clear shift from -ical to -ic 
forms, with no apparent trends of semantic differentiation, but merely one of drop¬ 
ping the forms in -ical from use. 

Interestingly enough, the same kind of change—both as regards the synonymous 
use of the forms and the time when one can perceive a shift in favor of the shorter 
forms—can be seen, for example, with the adjective pairs authentic/authentical and 
heroic/heroical, as is evident from Tables 2 and 3. 

A notable feature in Table 3 is the numbers of authors using both forms, espe¬ 
cially in the period 1600-49, when almost all of the writers who used the adjectives 
heroic/heroical were using them interchangeably, as e.g., Alexander Hart in examples 
(2) a andb: 
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heroic 

heroical 

Number of authors 
using both forms 

1550-99 

5 (2) 

48 (15) 


1600-49 

26 (8) 

36 (7) 

6 

1650-99 

79 (17) 

40 (7) 

6 

1700-49 

69 ( 9 ) 

3 (1) 

1 

1750-99 

141 (21) 

3 ( 3 ) 

3 

1800-39 

145 (20) 

3 ( 3 ) 

3 


Table 3. The numbers of occurrences o/heroic/heroical in Literature Online. 



tragic 

tragical 

Number of authors 
using both forms 

1550-99 

16 (7) 

69 (19) 

5 

1600-49 

17 ( 7 ) 

61 (11) 

5 

1650-99 

12 ( 7 ) 

51 (18) 

3 

1700-49 

20 (4) 

34 (8) 

4 

1750-99 

38 (10) 

53 (16) 

5 

1800-39 

79 (n) 

27 (13) 

7 


Table 4. The numbers of occurrences of tragic/tragical in Literature Online. 

(2) a. This grave Senator had a Sonne who bearing his name, disagreed not from 
his properties, but punctually had his Fathers heroicke inclinations, which 
made him as happy as meritorious, and was no lesse honoured of all 
men... (Alexander Hart, Alexto and Angelina (1640), 19-20) 
b. ... Sandrico for the better satisfaction of his friend and himselfe, begun to 
enquire and demand of the Inhabitants what they could informe him con¬ 
cerning the heroicall inclination of this Roman Goddesse,... (Alexander 
Hart, Alexto and Angelina (1640), 56) 

It seems justifiable to say then that with some adjective pairs ending in -ic and -ical, 
there was a noticeable shift from the use of -ical forms to the -ic forms in the 17th 
century, although during this period, as could be concluded from the OED chart in 
Figure 1, new words in -ical were still coined. This is particularly interesting in the 
light of the comments made by Earle (1892), as the tendency to discard the existing 
forms in -ical appears to have started earlier than he had assumed. In fact, it appears 
to have begun before the use of the suffix as a productive derivational pattern had 
gone out of fashion. 

On the other hand, we do not get similar results to these when examining pairs such 
as tragic/tragical and tyrannic/tyrannical, as seen in Table 4 and Table 5 (overleaf), con¬ 
forming with Earle’s observation of the adjective pairs’ behaviour not being uniform. 
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tyrannic 

tyrannical 

Number of authors 
using both forms 

1550-99 

— 

28 (8) 


1600-49 

— 

2 (2) 


1650-99 

7 ( 4 ) 

46 (13) 

1 

1700-49 

8 ( 3 ) 

19 (6) 

2 

1750-99 

9 (8) 

44 (20) 

7 

1800-39 

12 ( 5 ) 

44 (12) 

4 


Table 5. The numbers of occurrences of tyrannic/tyrannical in Literature Online. 


philosophic philosophical Number of authors 

using both forms 

1550-99 



12 

(5) 


1600-49 



6 

(3) 

6 

1650-99 

1 

(1) 

23 

(8) 

6 

1700-49 

4 

( 4 ) 

21 

( 7 ) 

1 

1750-99 

54 

(19) 

39 

(18) 

3 

1800-39 

45 

( 9 ) 

78 

( 17 ) 

3 


Table 6. The numbers of occurrences of philosophic/philosophical in Literature Online. 

In the case of tragic/tragical, the longer form is in present-day English rare when 
compared to tragic. From a temporal viewpoint, however, the developments with this 
word pair do not seem to go along the same lines as with the adjective pairs discussed 
earlier, as the form in -ical appears to have survived well into the 19th century. With 
tyrannic!tyrannical, the -ic form enters the scene in the latter part of the 17th century, 
but the status of the longer form never became threatened, and today tyrannical is 
clearly the favored form. 

Even more interesting changes can be found in the use of philosophic/philosophical, 
the numbers occurrences given in Table 6. In the latter half of the 18th century, the 
shorter form became more common, but during this period the two forms were 
used differently: philosophic in this period was more often used in a wider, more 
popularized meaning, appearing in phrases such as a philosophic manner or a philo¬ 
sophic look/expression. When referring to things pertaining to philosophy as a sci¬ 
ence, with collocating words such as research, theories or argumentation,philosophical 
was clearly the preferred form. After that period, however, this semantic distinction 
began to disappear, and philosophical became the more common form in all senses 
used—and in present-day English, one can still occasionally come across instances 
of the shorter form, but philosophical is considerably more frequent. In this case we 
would then actually encounter both processes of getting rid of redundancy, semantic 
differentiation and dropping one of the forms from use. 
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One may wonder why adjective pairs seem to differ so much in their behavior as 
regards either differentiation between the forms or dropping one of the forms. If there 
was a shift from -ical to -ic in the 17th century, why did it affect only some adjective 
pairs, and not all of them? One possibility could be in the meanings of the base words, 
as proposed by Elphinston (1765). While nothing in the use of the examined adjec¬ 
tives in the corpus suggests that the use of either the -ic or -ical form depended in any 
way on the ‘familiarity’ or ‘solemn nature’ of the occasion (as Elphinston also argued), 
the semantic aspects of the words themselves do provide a common feature. For exam¬ 
ple, angelic(al), heroic(al), and majestic(al) seem to have more in common with each 
other than with philosophic(al) or poetic(al), the shared component being the semantic 
aspect of nobility. If this indeed were the case, an interesting conclusion would be that 
the meaning, the solemn nature of the base word itself might have had an effect in the 
choice of how to get rid of redundancy: not semantic differentiation, but dropping 
the forms in -ical from use. Although Elphinston did not specifically comment on the 
language actually changing as regards the use of the two suffixes, his observations on 
the matter—somewhat surprisingly not referred to since by lexicographers or gram¬ 
marians—would seem to find partial support from the material examined. 

Based on the Literature Online material alone, it might be early to suggest clas¬ 
sifications of -id-ical adjectives according to the changes in their use, but presented 
below is a preliminary suggestion of such a classification, showing that adjectives 
having the semantic component of nobility could, in fact, be observed as behaving in 
the same way. This tendency is further strengthened in the corpus by the adjective 
pairs prophetic(al) and seraphic(al), which, although not as frequent as the other pairs 
in group 1, nevertheless show a similar shift from -ical to -ic. 

I. shift from -ical to -ic in the 17th century or early 18th century— 

NO SEMANTIC DIFFERENTIATION 

a. base word has the semantic characteristic of nobility or grace: angelic(al), 

authentic(al), heroic(al), majestic(al), prophetic(al), seraphic(al) 

b. others: domestic(al) 

II. # of occurrences in -ic increases (in the 18th century)— 

semantic differentiation —both forms remain in use 

a. the form in -ic stronger/usual: comic(al), fantastic(al), magic(al) 

b. the form in -ical stronger/usual: poetic(al) 

III. # of occurrences in -ic increases, but the -ical form remains the usual form- 

NO SEMANTIC DIFFERENTIATION 

tyrannic(al) 

IV. the form in -ic becomes more frequent— semantic differentiation— 

followed by a shift back to - ical , with no semantic differentiation 

philosophic(al) 

V. shift from -ical to -ic after the first half of the 19th century— 

NO SEMANTIC DIFFERENTIATION 

tragic(al) 
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3. conclusion. It could be said that obviously more research has to be done on the 
-id-ical adjectives in the Early Modern English period as well as other periods so as to 
include more adjective pairs in the classification—and there would probably be even 
more classes than the ones suggested here. The findings from the Literature Online 
corpus does not obviously give us answers for all of the questions about the use of 
-id-ical adjectives, but they do suggest that there have been a number of different cur¬ 
rents of change. The observation made on adjectives such as angelic(al), heroic(al), 
and majestic(al), sheds new light on the matter of competition between these two der¬ 
ivational patterns in that in cases where both forms have already existed, the semantic 
aspects of the adjectives themselves may be considered to have affected the applica¬ 
tion of one of language’s ‘self-purifying’ tendencies, in this case attrition instead of 
differentiation. 


Excluded from the investigation were of course those words which actually were prefixed 
forms of already existing words in -id-ical. 

This figure refers to the prose fiction databases of the Literature Online as they were in 
February 2000—the 19th century fiction database has been since completed to include 
works from 1840 to 1900. 
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introduction. The treatment of demonstrative pronouns in French is usually simple 
and uncontroversial. This statement by Grevisse (1986:1054) is representative of the 
analyses found in traditional grammars, as well as early linguistic accounts: 


'Les pronoms demonstratifs designent un etre ou une chose en les situant dans 
lespace, eventuellement avec ungeste a I’appui (fonction deictique). Ils peuvent 
aussi renvoyer a un terme qui precede (fonction anaphorique) ou qui suit (fonction 
cataphorique) dans le contexte.’ 

‘Demonstrative pronouns designate a person or a thing by situating them in space, 
sometimes with an accompanying gesture (deictic function); they can also refer to a 
preceding term (anaphoric function) or a following term (cataphoric function) in the 
context’. 

In this paper, I argue that the French demonstrative pronoun f a (which will be 
alternatively translated by ‘this’, ‘that’, or ‘it’) does not merely point to an established 
referent, but creates that referent from the surrounding discourse. In this view, the 
pronouns presence instructs the hearer to organize the context of the utterance in a 
highly specific fashion. In order to argue this position, I examined 464 instances of f a 
from a corpus of newspaper (Le Monde) articles published between 1988 and 1990. The 
analysis proposed here is not intended to be exhaustive of all the possible senses of 
the pronoun, but merely indicative of a methodology to approach its investigation. 

1. a first attempt at the meaning of $a. A strictly referential analysis of fa would 
consider the pronoun’s discourse function as a direct extension of its deictic value. It 
is untenable because in numerous instances, f a doesn’t anaphorically refer to an exist¬ 
ing entity. The first argument against a referential account is provided by the contrasts 
that exist between true anaphors (personal pronouns) and f a in related situations. 
Contrasts such as the ones illustrated in (1) and (2) are well studied in the literature, 
most notably by Cadiot (1988) and Carlier (1996); the example in (1) is adapted from 
Carlier 1996: 

(1) a. Les gosses, ils se levent tot b. Les gosses, fa se leve tot 
The kids, they get up early Kids, they get up early 
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(2) a. Les Frangais, ils se plaignent tout le temps 

The French, they are always complaining 
b. Les Frangais, ga se plaint tout le temps 
French people, they complain all the time 

In (i)a and (2)a, the personal pronoun ils ‘they’ is strictly anaphoric of its referent 
because i) the agreement on the verb matches up with the gender and number of 
the referent, and ii) the preferred reading of the utterance is specific. For example, 
in the preferred reading of (i)a, the speaker has specific kids in mind, and it is about 
those kids that the predication of se lever tot ‘get up early’ is made. The examples in 
(i)b and (2)b illustrate what Cadiot (1988) calls decalages referentiels ‘referential shifts’. 
The subject/verb agreement does not match up with the referent’s number (it is in the 
singular), and the reading of the utterance is necessarily generic. This generic reading 
is confirmed in (3) by the restrictions that exist on the determiner allowed in the NP 
that denotes fa’s referent. (Carlier 1996). 

(3) a. Le/mon/les gosses du voisin, ils selevent tot 

The/my/the neighbors kids, they get up early 
b. *Le/mon/les gosses du voisin, ga se leve tot le matin 
The/my/the neighbors kids, that gets up early 

The second argument comes from the cases where the pronoun’s referent is not 
immediately identifiable, as illustrated in (4): 

(4) J’avais garde de bons copains du temps de I’Opera, dont un qui etait passe 
chez Cuevas. II m’a attrappe dans un bar de la rive gauche et m’a conseille la 
mode, m’a envoye presenter mes dessins. A lepoque, je m’habillais beaucoup, 
j’etais presque un personnage avec des avions, des nuages d’or et d’argent 
decoupes sur le dos de mes blousons. Je ne savais rien, j’ai prepare un dossier, 
ga a marche. J’ai appris comme ga, etje n’aipas arrete. 

‘I had kept good friends from the Opera days, including one who had gone to 
Cuevas. He caught me in a bar on the left bank and advised me to get into fashion, 
and sent me to show my drawings. At the time, I would dress up a lot, I was almost 
a character with planes, gold and silver clouds cut outs in the back of my jackets. 

I didn’t know anything, I prepared a portfolio, it worked out. I learned like that, 
and I never stopped.’ 

Intuitively, the referent of the first fa cannot be the nominal dossier ‘portfolio’, because 
a portfolio cannot be said to be working or not. The pronoun therefore doesn’t refer 
to a nominal directly available in the current discourse. If ga cannot be considered a 
direct anaphor to its referent, how can we characterize its meaning? Or, to rephrase 
the question in Cognitive Grammar terms (Langacker 1987a, 1991) *, what does fa pro¬ 
file? I would like to suggest that fa provides the hearer with the instructions to create 
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a specific region within the space provided by the context of the utterance. To put 
it differently, within the space of the context of the utterance, fa’s presence instructs 
the hearer to construe a group of entities as interconnected (Langacker 1987a, 1987b). 1 
The entities to be considered together are not always made specific, and the pronoun’s 
presence often requires the hearer to actively manipulate the current discourse in 
order to achieve the appropriate construal. In that view, fa’s presence enables the 
speaker and hearer to interactively sequence and group certain elements of the text 
for communicative purposes. 

2 . gA and the notion of created region. This hypothesis accounts at the same 
time for the pronoun’s deictic meaning as well as its discourse sense. 

2.1. deictic sense. Compare the example in (5)a to the one in (5)b. The context in (5) 
is that of a child expressing her preference in front of a pastry shop window display¬ 
ing cakes. 

(5) a. Je veux f a b. Je veux celui la 

I want that I want this one 

In both cases, the child needs to point toward the desired entity to get her point 
across, but assuming that her gesture is accurate, her utterance will produce the same 
result in both instances. However, the pronoun’s referent is accessed differently in the 
two situations. In (5)b,the typegdfeaucake’has either already been introduced, or it is 
implicit from the discourse situation, but it is shared between the child and her inter¬ 
locutor. The child is pointing to a particular instance of that established type as the 
object of her desires. In (5)a, there is no specific type shared between the interlocu¬ 
tors. The child is pointing to the cake as some portion of an undifferentiated mass. 
She is not accessing that mass as an example of any category, but in a sort of pre¬ 
categorized way, before the separation in the different types is made. £a therefore pro¬ 
files a region of undistinguished (pre-categorized) physical matter, which represents 
the definition of an object before it is characterized as a specific type. The specific 
nature of the object is recoverable by the immediate context, hence the importance of 
the accompanying gesture. 

2 . 2 . referential shifts. Consider the referential shift construction illustrated in ( 6 ): 

(6) La voiture, f a me plait 
Cars, I like them 

We have already seen that the nominal la voiture cannot be considered the ref¬ 
erent of f a per se. It is also obvious that fa’s referent is closely related to that nomi¬ 
nal. Consistent with the hypothesis proposed in the previous section, I claim that the 
pronoun’s referent is a more abstract nominal composed of a specific region created 
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around la voiture. In (6), the predicate plaire profiles the conceptualizer’s reaction to 
several possible kinds of activities associated with cars. It therefore seems reasonable 
to say that the mention of the nominal voiture evokes the set of activities associated 
with it. Activity is understood in its broadest sense that includes events (and possibly 
full propositions) where the nominal is a participant. (ja profiles the abstract region 
defined by construing these activities as a set of interconnected entities. 

Note that in these examples, there is no restriction on the possible nature of the 
extracted nominal. Definites and proper nouns are perfectly felicitous, as shown in (7) 
and (8). 

(7) Monfrere, ga m’interesse 
My brother, that interests me 

(8) Tenez, sans aller si loin, il etait pas bien mon Jacquot a I’Hotel de Ville ? Non, 
Matignon, ga lafoutait mieux, c’etaitplus classe question adresse. 

There we go, no need to look further. City Hall wasn’t good enough for Jack baby? 
No, Matignon, that sounded better, as an address, that had more class. 

The example in (7) can only be fully understood in context. (ja profiles the abstract 
region of activities, behaviors, beliefs etc. that surround the speaker’s brother, in rela¬ 
tion to which the predicate can be asserted. In (8), the proper name Matignon refers 
to the French prime minister’s residence, and thus metonymically stands for his func¬ 
tion, with all the considerations of money, power, prestige that it conveys, Cja’s pres¬ 
ence provides directions to construe these considerations as a set of interconnected 
entities (i.e. an abstract region). 

Let us now turn to the constructions briefly introduced in (1) and (2). Additional 
examples are presented in (9) and (10). 

(9) Ce que vous pouvez etre susceptibles, les enfants, c’est pas croyable. Tant que 
j’egratigne le voisin, ga va, vous m’envoyez desgerbes defleurs: Ce que t’es 
rigolote ! T’es vraiment la reine. Genial, ton billet, I’autrejour, sur lespaysans. 
On Va lu enfamille, on se marrait comme des baleines. Mais quand c’est vous 
queje taquine, ga vapas,je regois des giclees de bile : Ce que t’es conne ! 

T’es vraiment la reine. Nulle, ta crotte, I’autre jour, sur les agriculteurs. Un 
copain me la lue au telephone, j’en croyaispas mes oreilles. Les pires, c’est les 
Corses, lesjuifs, les Arabes, les chasseurs et les enseignants. C’est ombrageux, 
Venseignant, on n’apas idee, (ja sourcille, ga se herisse, ga se vexepour un 
rien. La, en ce moment, je les ai tous sur le paletot. 

‘How succeptible can you be, guys. I can’t believe it. As long as I scratch your 
neighbor, everything is fine, you send me flowers: How funny you are! You are really 
the queen. Superb, your note the other day about peasants. The whole family read 
it and we couldn’t stop cracking up. But when it’s you I tease, we have a problem. 

You cover me with bile: What an idiot you are! You are really the queen. Zero, 
your piece of trash about farmers the other day. A friend read it to me over the 
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phone, I couldn’t believe my ears. The worst are the Corsicans, the Jews, the Arabs, 
the hunters and the teachers. That broods, a teacher, one wouldn’t believe. That 
frowns, that gets all prickly, that gets offended for nothing. Right now, I have them 
all against me.’ 

(to) Etre ministre aujourd’hui c’est d’abord, semble-t-il, apprendre dfaire taire 

I’homme de gauche qui sommeille en soi. Un ministre, ga ouvre sa gueule a la 
rigueur (drole d’expression) pour parler de “justice sociale ”, de “ partage ”, de 
“ solidarity ”et de “pacte de croissance ”, mais si ga ne veut pas demissionner, 
ga doit la boucler devant des salaries agitant leurfiche depaie sur laquelle les 
grands mots lances depuis huit ans n’ont pas fait fleurir beaucoup de zeros. 

‘To be a secretary of state nowadays is primarily, it seems, to learn to gag the left 
wing person inside of us. A secretary of state, that opens its mouth if necessary to 
talk about “social justice”, about “sharing”, about “solidarity”, and “growth plan”, but 
if that doesn’t want to resign, that must shut up in front of workers who wave their 
pay stubs on which the rhetoric of the past eight years hasn’t added many zeros.’ 

The examples in (9) and (10) are similar to the ones presented in (6)—(8) with 
the exception that the predications made about the relevant nominals represent defi¬ 
nitional properties or attributes of those nominals. For example, the category ensei- 
gnant in (9) includes the behavioral traits described in the following predications. I 
propose that in these constructions, ga profiles a specific region created around the 
evoked category. This region is composed of the sum of the elements that provide 
the conceptualizer with her knowledge of the category, that is to say the facts, opinions, 
or other beliefs that arise in the conceptualizer when it is evoked. This knowledge of 
the category enables the speaker to pronounce the judgement expressed in the follow¬ 
ing predicates. To continue with (9), the category enseignants evokes the facts, beliefs, 
or even sensations or memories about its members that constitute the speaker’s knowl¬ 
edge of the category. This knowledge is expressed by the predicates etre ombrageux 
‘to brood’, sourciller to frown, se herisser ‘to get prickly’, se vexer pour un rien ‘to get 
offended for nothing’. 2 

2.3. $a and reality. In (11), the political figure Alain Poher was famous for maintain¬ 
ing a high profile in the senate for a very long time. 

(11) Poher, ga suffit! Comment ils osent ? (ja fait jamais que vingt 
ans qu’ily trone, au palais du Luxembourg, lepauvre cheri. 

‘Poher, that’s enough! How dare they? It’s only been twenty years he’s been reigning 
at the Luxemburg palace the poor dear.’ 

The exact nature of the abstract region that surrounds the nominal Poher is dif¬ 
ficult to characterize precisely. One possible solution would be to define it as the sum 
of the senator’s accomplishments. However, it doesn’t seem convincing in this con- 
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text, because political successes and failures do not constitute the topic of discus¬ 
sion. Rather, what is being commented on is the amount of time during which Poher 
has been part of the political scene. Because the passing of time is so crucial to the 
understanding of the utterance, I suggest that the abstract region anchored around 
the extraposed nominal represents the specific portion of evolutionary momentum 
(Langacker 1991) that prominently contains Poher. The second occurrence of fa pro¬ 
vides additional details by indicating the exact duration of that sub-part of reality. 3 

Instead of a person, the section of reality f a evokes can be anchored around a spe¬ 
cific situation as in (12). 

(12) Etpuis, un soir de janvier 1989, le telephone a sonne. C’est Daniel qui a 
decroche, la-baspres du canape rose. “ Manuela, quoi Manuela ? - Elle vivait 
avec unjeune homme. (ja s’est mal passe entre eux. II I’a tuee. Venez vite. 

On vous attend. ” 

And then, one evening of January 1989, the telephone rang. Daniel picked up, over 
there by the pink sofa. ‘Manuela, what about her?—She was living with a young 
man. It went sour between them. He killed her. Come quickly. We are waiting for 
you.’ 

In (12), f a evokes the sub-part of reality that encompasses the common life of Man¬ 
uela and her boyfriend. Its contents is available to the hearer as part of her world 
knowledge of what the life of a young couple might entail. (ja instructs the hearer to 
connect together the different facets of what she knows about life as a couple to create 
a nominal construal of the young lovers’experience. The ensuing event or proposition 
can then be predicated of that created nominal. 

In other cases, f a evokes the sub-part of reality that includes a particular event or 
proposition, as illustrated in (13) and (14). 

(13) Nous vendons en moyenne quinze mille exemplaires d’un disque. Passion, 
d’Arvo Part, en est a trente mille. (ja s’est fait defafon non premeditee. Je ne 
reflechis jamais en terme de rendement ou de budget. 

On average, we sell fifteen thousand copies of an album. Passion by Arvo Part has 
sold thirty thousand so far. It happened unexpectedly. I never think in terms of 
revenue or budget. 

(14) Ilfaut dire que le maire de Beaucaire est lui-meme instituteur depuis vingt- 
huit ans, mari et pere d’institutrices. C’est en partie pour des raisons “educa- 
tives” qu’il justifie les mesures prises a I’encontre d’enfants immigres. “J’ai 
dans ma classe quatre enfants maghrebins sur vingt-huit. (ja ne pose aucun 
probleme. Mais, quand on arrive, comme c’est bien souvent le cas a Beaucaire, 
a plus de so %, fa devient impossible”. 

One must say that the mayor of Beaucaire has been a school teacher himself for 
twenty eight years. He is also the husband and father of school teachers. It is partly 
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for ‘educational’ reasons that he justifies the measures taken against immigrant 
children.‘In my class, I have 4 children from the Maghreb out of 28 students. That 
doesn’t pose any problems. But when you reach a total of over 50% as it is often the 
case in Beaucaire, that becomes impossible.’ 

In (13), f a represents the segment of evolutionary momentum that brought about 
the proposition Passion, d’Arvo Part, en est a trente mille ‘Passion by Arvo Part has 
sold thirty thousand (copies) so far’ as part of present reality. In (14), the communica¬ 
tive purpose of the paragraph is to contrast two kinds of realities differentiated by 
the propositions they contain. The first f a evokes the sub-part of reality that includes 
the proposition J’ai dans ma classe 4 enfants Maghrebins sur 28 ‘In my class, I have 
4 children from the Maghreb out of 28 students’. The second fa evokes the sub-part 
of reality that includes a different proposition, namely quand on arrive a plus de so% 
‘when you reach a total of over 50%’. 

Finally, the scope of the sub-part of reality fa evokes can be wide enough to coin¬ 
cide with present reality, as shown in (15)—(17). 

(15) Une femme est debout dans I’entree, blonde, les cheveux sur les epaules. Une 
apparition. “Tu me reconnais ? ”, demande-t-elle. “Oui, repond Suzy. Mais 
comment f a sefait que t’es la ?” 

A woman is standing in the hallway, blond, her hair shoulder length. An apparition. 
‘Do you recognize me?’ she asks.‘Yes, Suzy answers. But how come you are here?’ 

(16) Dans dix jours, Mr Gorbatchevpresente devant le Comite central un plan de 
democratisation acceleree du parti et du pays. Plus f a va mal, plus il veut 
aller de I’avant, convaincu qu’il est maintenant que seule la radicalisation 

de la perestroika peut empecher son echec, puisqu’il ne faut pas freiner mais 
accelerer dans les virages. 

In ten days, M. Gorbachev will present a plan of accelerated democratization for 
the party and the country in front of the Central Committee. The worst it goes, 
the more he wants to go forward, convinced as he is that only the radicalization 
of Perestroika can prevent its failure, since one should not break but accelerate in 
the middle of a turn. 

(17) Olivier a done ete appele a la barre. Pere dun enfant depuis un mois, c’est 
pourtant encore un adolescent qui a eu dix-neuf ans en septembre dernier. II 
est si intimide qu’il murmure ses reponses au president Jean-Franqois Perie: 

“ J’avais desproblemes, jepreferais mourir... ”, balbutie lejeune homme. 

“ Et maintenant, f a va ? ”s’inquiete le magistral Apres un silence, Olivier 
soupire : “ va... ” 

So Olivier was called to testify. The father of a child for a month, he is, however 
still a teenager who turned nineteen last September. He is so intimidated that he 
mumbles his answers to president Jean Francois Perie: ‘I had problems, I preferred 
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to die’ the young man whispers. ‘And now, is it OK?’ the judge asks. After a pause, 
Olivier sighs ‘It’s OK...’ 

In these three examples, f a evokes present reality, that is to say the sum of events 
and propositions that combined to produce the state of the world we are currently 
witnessing. For example in (15), Suzy is expressing surprise at the current state of 
events that include the presence of a person she didn’t anticipate to see. 

Specific aspects of reality can be made particularly salient in their construal by f a. 
For example, the pronouns referent frequently profiles the precise length of time real¬ 
ity has included a certain event or proposition. Such cases are presented in (18)-(20). 

(18) Mon Jules, quel Jules ? (Ja vafaire deux ans qu’on estplus ensemble. 

My man, what man? It will be two years we haven’t been together 

(19) Et puis, des qu’il a appris la nomination d’Antoine Vitez a la tete de la 
Comedie-Frangaise, cejeune comedien free lance s’est dit qu’il etait peut-etre 
temps de s’engager plus quepour un role : “ (Jafait huit ans quejefais ce 
metier etje ne m’etais jamais engage a long terme. C’est pour f a queje n’aipas 
fait le Conservatoire ou decole. 

And then, as soon as he learned Antoine Vitez’s was the new director of the 
Comedie Franfaise, this young free lance comedian said to himself that maybe it 
was time to commit himself for more than one role: ‘It’s been more than eight years 
that I’ve been in this business and I had never committed myself for the long run. 
That’s why I never went to the Conservatory, or any other school.’ 

(20) “Alors, f a y est, Edu, tu te remets a chanter ? “ C’est avec I’humour des 
gens de Rio, toujours prompts a raconter les pires histoires en se moquant 
d’eux-memes, comme s’ils etaient les acteurs d’une comedie de boulevard, 
qu’Edu Lobo evoque ses vingt-cinq ans de carriere : celle d’un compositeur qui 
voudrait bien n’avoir qua composer, mais est contraintpar la vie, les autres, 
ou lui-meme, a remonter sur scene. 

‘So, that’s it Edu, you are singing again?’ It is with the humor of the people of Rio, 
always ready to tell the worst stories and make fun of themselves, as if they were the 
actors in a Vaudeville, that Edu Lobo evokes his twenty five year career: the career 
of a composer who wished he could have remained only a composer, but who was 
forced by life, other people or himself to start performing again. 

2.4. cognitive manipulations. So far, the analysis has concentrated on the nature of 
the region f a creates out of the immediate context. This section examines the way in 
which fa’s presence signals the cognitive manipulation (summarization, abstraction, 
inferrencing) of information already available in the context or in the hearer’s world 
knowledge. In (21) and (22), the pronoun’s presence instructs the hearer to summarize 
the information at her disposal. 
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(21) Les affaires de Manuela ? Elies etaient sous scelles, intouchables. Les seules 
parcelles de realite, les seules preuves symboliques de la mort de Manuela, 
Maite Gourjault les a puisees alors dans ces rares images : un policier dans 
un souterrain, un avocat a qui elle donne un cheque, deux rendez-vous chez 
lejuge d’instruction, Yincineration d’une caisse de bois anonyme et quelques 
papiers quelle apu feuilleter. Etpuis ilyeut les lettres dujeune homme ecrites 
depuis sa prison. “ Bonjour Maite, dis-moi comment tu vois tout f a vu de 
Yexterieur. Pour ma part, tout reste inexplicable.” 

Manuelas stuff? It was sealed, untouchable. The only fragments of reality, the only 
symbolic proofs of Manuelas death, Maite Gourjault dug them out of these rare 
images: a policeman in a tunnel, a lawyer to whom she gave a check, two appoint¬ 
ments with the judge, the cremation of an anonymous wooden box and a few 
papers she could flip through. And then there are the young mans letters from his 
jail cell. ‘Hello Maite tell me how you see all that from the outside. For my part, 
everything remains inexplicable’. 

(22) On plonge dans Yellow Submarine avec unefacilite qui, au debut, deconcerte. 
Le delire visuel coordonnepar George Dunning reprend tous les cliches de la 
periode de I’op’art, qui fait mal a la tete, au style nouille passe a la peinture 
orange. Mais, sous la surcharge, le charme agitparce que, en accumulant 
une douzaine de trouvailles a chaque plan, onfinit par tomber juste assez 
souvent. Et, de collages en mises en abime, de labyrinthes en solarisation, on 
finit par retrouver la musique, univers poetique du groupe. Tout fa grace a 
une bonne volonte sans arriere-pensee qui fait de cette oeuvre de commande 
un instantane parfait de Yapogee des Beatles. 

One dives into Yellow Submarine with such ease that it is disconcerting at the 
beginning. The visual delirium coordinated by George Dunning makes use of all 
the cliches from the op’art period that gives you a headache to the noodles-covered- 
in-orange-paint style. But underneath the overload, the charm is effective because 
when one accumulates a dozen findings per plan, one ends up being right quite 
often. And from collage to zoom, from labyrinth to solarization, one ends up finding 
the music, the poetic universe of the group. All that thanks to a good will without 
second thoughts that makes that work on command a perfect snapshot of the peak 
years of the Beatles. 

The example in (21) comes from the same article as the one in (12 ). Manuela, Mai¬ 
te s daughter has been killed by her boyfriend (the jeune homme). It is quite difficult to 
precisely identify the entity that f a evokes, but it clearly involves the summary of the 
circumstances leading to (and including) the current situation. It is interesting to note 
that this entity is not shared by the participants in the discourse. The circumstances 
that surround Manuelas death are necessarily different (at least in part) for her mur¬ 
derer and her mother. This is, however, unproblematic for the analysis developed 
here where fa provides the hearer with the instructions to create a summary of the 
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circumstances available to her, whatever those might be. The abstract region profiled 
by f a is composed of the mother’s views, beliefs, feelings, and opinions about her 
daughter’s death connected together. In (22), the information to be summarized is 
specifically provided (at least partly) in the immediate context. (ja profiles the sum¬ 
mary of the experiences accumulated throughout the show. 

The presence of f a often involves other conceptual manipulations in addition to 
summarizing. Some of them are illustrated in (23) and (24). 

(23) Vous avez vu Radio days, de Woody Allen, cetait exactement cela ! Je nepeux 
pas dire mieux! Cette maison etait la mienne, la femme qui ne se marie pas, 
cetait ma tante ! La rue qui donne sur la mer, les voisins, Vecole, les parents, 
le football, le base-ball, tout f a etait ma vie ! “ 

You saw Radio days by Woody Allen, it was exactly that! I can’t say it any better! 

That house was mine, the woman who doesn’t marry, it was my aunt! The street 
that overlooks the sea, the neighbors, the parents, football, baseball, all that was 
my life! 

(24) Un an et demi apres nous devons constater quepour nous, la vie quotidienne 
ne change pas. En banlieue de Noumea, a Saint-Louis par exemple, six cents 
families wallisiennes vivent encore sans eau courante et sans electricite et, 
ily a quelques semaines, plus pres de Noumea, dans la vallee du Tir, nous 
avons du manifester parce qu’on avait coupe Veau a tout un quartier pour 
sanctionner sept mauvais payeurs. C’est tous les jours comme f a et, si Von 
n’y remediepas, cela signifie que les accords de Matignon ne sont qu’une 
mascarade. 

One and a half years later, we must say that for us daily life doesn’t change. In the 
suburbs of Noumea, in St Louis for example, six hundred Wallisian families still 
live without running water or electricity, and a few weeks ago, closer to Noumea, in 
the Tir valley, we had to demonstrate because the water had been turned off for a 
whole neighborhood to penalize seven people who hadn’t paid their bills. Everyday 
it’s like that and, if nothing is done, it means that the Matignon agreements are 
nothing but a farce. 

In (23), the author is attempting to describe his life growing up in the USA. His 
communicative strategy is to compare it to the kind of life described in the movie 
Radio Days by Woody Allen. This is accomplished in the first sentence by a parallel 
structure that establishes the entity evoked by the first c (a reduced form of f a) as 
similar to that of cela (related to fa). The entity c evokes is the summary of the author’s 
life. At this point, it is fairly abstract to the hearer. The entity cela evokes is the sum¬ 
mary of the events and situations presented in the film. It is readily available to the 
hearer, assuming she has seen the movie. The predicate that links the two entities 
allows the hearer to make the referent of c’more concrete by filling it in with elements 
of the movie. Later, the author makes his point more specific by establishing similar 
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correspondences between specific elements. The entity the final fa evokes recapitu¬ 
lates and summarizes these elements. 

In (24), the author is complaining about the poor treatment of the native popula¬ 
tion in New Caledonia by the French authorities following the Matignon agreements 
with France. The two examples provided are meant to be symptomatic of a general 
atmosphere. The entity fa evokes needs to be understood in that way. It cannot be said 
to summarize the two examples of bad treatment inflicted on the locals by the French 
authorities, but it generalizes them to all other possible instances of such behavior. 
The region profiled by fa is therefore quite abstract because it involves all the events 
that may conform to the pattern established by the preceding discourse. 

In other cases, the entity evoked by fa is only available by an inference created by 
the presence of specific elements in the immediate context. This is illustrated in (25). 

(25) A Hambourg, mercredi dernier, la ville oil les Beatles ontfait leur classes, 

McCartney a quand meme reussi a conjurer par moments ce malaise etrange. 
On ne pourra jamais s’empecher d’etre injuste avec McCartney, de lui repro- 
cher de ne plus etre ce qu’il a ete. Et c’est vrai qu’il est un peu mesquin dans 
le choix des morceaux: pas une chanson ne porte lapatte de John Lennon, 
fa n’aurait pourtant pas manque delegance, mais c’est peut-etre la qualite qui 
lui fait leplus defaut, comme en temoigne le gout douteux des rares effets 
sceniques. 

In Hamburg, last Wednesday, the city where the Beatles played at their beginnings, 
McCartney still succeeded at times to overcome this strange discomfort. One wont 
ever be able to stop being unfair to McCarney, to stop blaming him for no longer 
being what he was. And it is true he was a little classless in his choice of pieces: not 
a single song with John Lennon’s signature, even though that wouldn’t have lacked 
elegance, but it is perhaps the quality he lacks the most, as attested by the doubtful 
taste of the rare scenic effects. 

In (25), the pronoun’s referent isn’t directly available in the immediate discourse. 
The entity fa evokes is a nominal created by inference out of the preceding negative 
statement. It can roughly be characterized as une chanson qui porte la patte de John 
Lennon ‘a song with John Lennon’s signature’. 

2.5. Subjective construal. One of the most important aspects of the meaning of 
fa is the subjective construal it imposes on the entity it evokes. The notion of sub¬ 
jectivity is used here in the sense of Langacker (1985,1990). It refers to the vantage 
point from which a linguistic expression is conceptualized, as well as the viewing 
organization that exists between the conceptualizer and her conceptualization. The 
analysis of an expression’s viewing organization involves the investigation of the con¬ 
ceptual asymmetry in the construal relation between the conceptualizing subject 
and the object of conceptualization. Different configurations of viewing arrange¬ 
ment have been shown to have important linguistic correlates (Langacker 1985,1990, 
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Achard 1998). The subjective construal f a imposes on the entity it evokes is illustrated 
in (26) and (27). 

(26) Les Archaos ont investi le Cirque d’Hiver, et fa fait du vacarme. Ils se 
deguisent en punks desinvoltes, pratiquent la derision-deglingue avec une 
fraicheur revigorante, en bons enfants des Monty Python et du Magic Circus, 
cependant respectueux des lois du cirque. Simplement, ils les habillent a leur 
maniere. 

The Archaos have invaded the Cirque d’Hiver, and that makes a ruckus. They 
dress up as carefree punks, poke fun of everything with invigorating freshness, the 
rightful heirs to Monty Python and Magic Circus who nonetheless respect the rules 
of the circus. Simply, they dress them up in their own style. 

(27) Clair de terre nest pas une piece rose, (fa se passe dans nos campagnes. 

Clair de terre is not a light play: That takes place in our countryside’ 

The example in (26) is reminiscent of the ones in (13) and (14) in that the entity 
f a evokes represents the sub-part of reality that includes the presence of the specific 
proposition Les Archaos ont investi le Cirque d’Hiver. However, it differs from them 
in two major ways. Firstly, the perception of noise is presented as a most salient char¬ 
acteristic of reality to the point that it metonymically stands for reality. Secondly, the 
entity f a evokes is construed subjectively. 

In order to understand the expressions subjective character, consider a possible 
alternative to (26), namely Les Archaos ont investi le Cirque d’Hiver, et ils font du 
vacarme' the Archaos have invaded the Cirque d’Hiver, and they are making a ruckus’. 
In that alternative, the verb profiles the speaker’s conceptualization of the relation 
between the source of the noise (ils) and their creation (le vacarme ). The source of 
the process, as well as the main relation are well delineated and profiled. They are 
both treated as mere objects of conceptualization, while the speaker remains off stage. 
The subjects and objects of conceptualization are therefore maximally differentiated, 
but the situation is quite different in (26). The subject fa doesn’t represent the source 
of the noise, but the abstract location (sub-section of reality) where it is taking place. 
Importantly, that abstract location also includes the speaker. The source of the noise 
is more diffuse and also implicitly contained in that location. The verb faire doesn’t 
profile the creation of the source of the noise, but the sensation that fills the scene. 
The presence of fa therefore blurs the difference between the object of conceptualiza¬ 
tion (the source and production of the noise) and the conceptualizing subject (the 
speaker) by making them both part of the profiled scene. 4 

The situation is similar in (27). The application of the same kind of analysis yields 
the impression that the play is viewed from within, that is to say by someone actu¬ 
ally watching the show. This characteristic of f a is often used to present a situation 
in a more lively manner, as if the conceptualizer were actually experiencing it. I will 
simply give one additional example in (28). 
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(28) Vous avez vu un peu ce qui se passe dans ce pays ? C’est lafolie ! (fa proteste, 
f a rouscaille, fa rouspete dans tons les coins. 

Have you seen what’s going on in the country! It’s crazy! That protests, that com¬ 
plains, that bitches in every corner. 

The source of social unrest in (28) is clearly people in general. A possible alterna¬ 
tive could easily be les gens protestent, rouscaillent, rouspetent ‘people protest, com¬ 
plain, bitch’. However, that source is subjectively construed, implicitly situated within 
the abstract setting profiled by the subject f a. The main verbs do not portray clearly 
defined actions with clearly identifiable sources, but rather the more global atmo¬ 
sphere that results from the subjective construal of the conceptualized scene. 

3. Conclusion. This paper argued that the French demonstrative pronoun fa pro¬ 
vides the hearer with the instructions to create a region within the space provided 
by the specific context of the utterance, and that the pronoun profiles that region. 
It also showed that the precise nature of the region heavily depends on the context, 
and that it also often involves the conceptual manipulation of information available 
in the discourse. Finally, it argued that the pronoun imposes a subjective construal on 
the entity it evokes. 


In Cognitive Grammar, the schematic value of a noun is to designate a region in some 
domain, a region being defined as a set of interconnected entities. Consequently, in this 
paper, fa is treated as a schematic noun. 

The generic character of the construction comes from the fact the category is reduced to 
the undifferentiated mass of the facts, beliefs, emotions it triggers in the conceptualizer. 
This mass construal also accounts for the fact that the construction is often considered 
derogatory when it describes humans. This construction, however, is very complex, and 
the analysis presented in this paper should only be considered exploratory. 

The notion of evolutionary momentum represents the dynamic aspect of reality in Lan- 
gacker’s Dynamic Evolutionary Model (Langacker 1991) presented in Figure 1. 



Figure 1. Dynamic evolutionary model (from Langacker 1991:277). 

One might argue that the subjective construal it imposes on the entity it evokes is a 
direct consequence of the pronoun’s basic deictic character, but arguing in favor of this 
hypothesis would take us beyond the scope of this paper. 


4 
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WHAT'S IN THE HEAD? METAPHORICAL 
EXPRESSIONS IN POLISH AND ENGLISH 


Barbara Bacz 
Laval University 


Traditionally, metaphorical expressions have been regarded as figures of speech, i.e., as 
stylistic ornaments employed by writers or orators to enhance verbal performance 1 . 
Metaphor was a vital characteristic of what has been called‘poetic (or‘metaphorical’) 
language, i.e., a language containing unusual (Lakoff uses the adjective ‘novel’) expres¬ 
sions in which one or more words for a concept were used outside of their normal 
conventional meaning to express a concept that was somehow similar. It was widely 
assumed that everyday language was not metaphorical. In consequence of this view, 
the study of metaphor as a phenomenon was confined to the realm of literary scholar¬ 
ship with the rare exception of language-oriented work produced by philologists who 
professed an interest in stylistics. 

The novelty of the Lakoff-style cognitivist approach to the phenomenon of met¬ 
aphor (see the work of Lakoff and collaborators 1980-98) lies in the fact that the 
so-called ‘metaphorical’ uses of language ceased to be viewed as figurative embel¬ 
lishments of rhetorical style, but came to be considered (and studied) as a surface 
manifestation of the way in which speakers think; more precisely, of how they con¬ 
ceptualize abstract categories making use of their knowledge of the categories that are 
familiar to them from everyday experience. 

One implication of the cognitivist view of metaphor for the general theory of lan¬ 
guage is, from the perspective of an ordinary linguist, that the place of this linguistic 
phenomenon has been shifted from the surface level of performance to the underly¬ 
ing level of competence. Another consequence, this time for the study of meaning, 
is that conceptual metaphor has come to be regarded as a phenomenon central to 
the semantics of everyday language, whereas traditional, poetic metaphor has been 
relegated to the peripheral position of conceptual metaphor’s extension. 

Metaphor viewed as a cognitive instrument is defined as a process of‘conceptual¬ 
izing one mental domain in terms of another,’ a mental operation which cognitive 
linguists call a cross-domain mapping (Lakoff 1993). The term ‘domain used in this 
definition should be interpreted as referring to a body of knowledge ‘experienced and 
stored for a certain field by individuals and social groups or cultures’ (Ungerer & 
Schmid 1996:120). Metaphorical mapping implies transfer from the source domain 
to the target domain. What gets transferred in the process are not just the properties 
of the source category but the entire structure of internal relationships characterizing 
the domain of the source category. Internal relationships of the source category that 
are exploited in a given metaphor are called structural mappings. They capture basic 
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structural analogies (or correspondences) between the source and the target catego¬ 
ries and are manifested in language by sets of specific metaphorical expressions, e.g., 
in the metaphor +love is a journey+, proposed and discussed by Lakoff (1992:4), 
one such analogy would be: the lovers correspond to travellers, and it would 
be manifested by such everyday expressions as Look how far we’ve come; It’s been a 
long road, etc. Lakoff (ibid.) insists that the principle underlying structural mapping 
is neither part of the grammar nor of the lexicon, but that it belongs to the concep¬ 
tual system underlying the language in question (here English) which permits under¬ 
standing of one domain of experience, (here love), in terms of a very different domain 
of experience, (here journey). One of the major objectives of Lakoff’s research since 
1990’s has been to verify the hypothesis accounting for the principle underlying 
cross-domain mapping, which he calls the Invariance Principle, and which states 
the following: ‘Metaphorical mappings preserve the cognitive topology (that is, the 
image-schema structure) of the source domain, in a way consistent with the inherent 
structure of the target domain (Lakoff 1992:11). 

The objective of the present analysis of Polish metaphorical expressions making 
reference to the body-part head is to try to account for a small portion of these idioms 
using the cognitivist concept of metaphor as a tool of analysis. Polish metaphorical 
expressions related to the notion of gtowa ‘head’ will be compared with English meta¬ 
phorical uses exploiting the same body-part concept where the comparison appears 
significant in order to establish which aspects (structures) of the source category have 
produced similar expressions in the two languages and which ones have led to differ¬ 
ent metaphorical realizations. 

Two large groups of examples relating to the concept of body-part head in the data 
are: (a) set lexical expressions with the word head/glowa applied to domains other 
than the domain of the body-part, and (b) idiomatic expressions in the form of sen¬ 
tences from everyday speech which make use of the notion of the body-part head 
to represent the speakers understanding of the abstract concept of the human mind 
and the activity of thinking. In the following analysis, I will discuss these two types of 
metaphorical examples using the cognitivist concepts of lexical and conceptual meta¬ 
phor, respectively. 

Other metaphorical expressions, which exploit the concept of the body-part head 
to refer to life ( Przyplacil to glowq / It cost him his head), talent or predisposi¬ 
tion ( Ma glowq do interesow / He has a head for business), or to denote a measure of 
length or quality ( Przerasta mnieo glowq / He’s head and shoulders above me), etc. will 
not be considered. Also, when talking about metaphorical expressions, I am using the 
adjective ‘metaphorical’ in its broad sense meaning ‘figurative’, i.e., non-literal. Thus, 
the corpus of metaphorical expressions considered in this paper includes all kinds of 
figurative uses independent of their subsequent classification. 

1. lexicalized (or‘dead’) head-to-plants metaphors. The cognitivist conception 
of metaphor as a conceptual instrument differs from the traditional perception of the 
phenomenon, among others, in the attitude towards linguistic units that have been 
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known as ‘dead’ metaphors. The term denotes originally metaphorical expressions, 
such as head of state/ gtowa pahstwa, which have become so conventionalized that 
their metaphorical character is no longer noticed. These expressions have become 
lexicalized as independent dictionary entries and have thus acquired a literal mean¬ 
ing of their own. From a cognitive perspective, such lexicalized metaphors are not 
only not ‘dead’, but extremely important since they are the best examples of uncon¬ 
scious, and therefore most efficient metaphorical processes operating in language. 
(See Takoff & Turner 1989:129). A typical Polish-English example of such lexicalized 
metaphor is given in (1): 

(1) glowa/ glowka [diminutive of glowa] kapusty 

‘a head of cabbage 

The conceptual metaphor which resulted in the expression head of cabbage/ glowa 
kapusty consists in the transfer of important properties and cognitive structures 
inherent in the source category body-part head to the target category cabbage. The 
specific features and structures which underlie the metaphor, i.e., which are respon¬ 
sible for the transfer, are captured through the following structural correspondences: 

• The shape of a head of cabbage corresponds to the shape of the human 
head. 

• The position ‘up (above ground) of a head of cabbage corresponds to the 
position ‘up’ (with respect to the rest of the body) of the human head. 

The first of the two structural correspondences distinguished for (1) is obvious: both 
the human head and the head of cabbage are more or less round. This feature, how¬ 
ever, would not be sufficient to justify the presence of metaphorical mapping from the 
domain body-part head into the domains of larger superordinate categories, such 
as vegetables or even plants, a transfer which is apparently required by the Polish 
and English expressions containing the word glowa/ head such as glowka salaty! a 
head of lettuce or glowa slonecznika/ a sunflower head. Beets, potatoes and radishes 
are also round, but their shape has obviously not evoked the association needed for a 
metaphorical conceptualization based on the notion of head in either Polish or Eng¬ 
lish. (Polish uses the image of a bulb ‘bulwa’ as the source category for potatoes, and 
English has tuber, both terms being rather technical, in spite of the fact that in both 
languages potatoes do have eyes oczka [a diminutive form of oko eye’]). It seems that 
in the case of vegetables, size might be an important factor. Cabbage and lettuce may 
be about the size of a human head while potatoes, beets and radishes are normally 
smaller 2 . To me, however, a generalization that would apply to members of a domain 
larger than the domain of vegetables, i.e., the domain of plants, appears more con¬ 
vincing. It could be based on the observation that for the HEAD-metaphor to occur, 
the plant’s top parts have to be seen (as the plant is growing). That’s why I suggest that 
the second structural feature of the source category, the feature concerning the posi- 
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tion ‘up’ should be considered as the most significant element in the explanation of 
the metaphor. 

Both English and Polish exploit the concept of the body-part head metaphorically 
to refer to the top parts of certain flowers, such as sunflower, daisy or clover (glowka 
stokrotki, slonecznika / head of a daisy, a sunflower, as in Slonecznik zwiesil glowg ‘The 
sunflower has hung down its head’ or glowki koniczyny / clover heads), or generically, 
to the top parts of all flowers, as in Chlopczyk poobcinal glowki kwiatom w wazonie 
‘The boy has cut off the heads of all the flowers in the vase’. The lexical metaphor 
in this case is motivated not by the size of the flowers (sunflowers are big while dai¬ 
sies and clover are small) but by the fact that their heads are round and what’s more 
important, that they are placed ‘up’ on their stems (the latter feature additionally 
recalling the linear shape of the human body with the head on top) 3 . 

Discussing the use of body-part terms in expressing concepts of spatial orientation, 
Heine (1997:40-41) observes that 87% of all African and 61% of all Oceanic languages 
have grammaticalized the word for head to conceptualize the spatial orientation con¬ 
cept ‘up’. The importance of the head -to- orientation up metaphorical mapping in 
the languages of the world stems from the speakers’ experience of the anthropomor¬ 
phic model of the vertically-oriented human body with the head on top. 

Lexicalized metaphors are important for the understanding of polysemy since they 
represent what has been described as multiple categorization (See Ungerer & Schmid 
1996:118). The cognitive view of metaphor permits us to interrelate categories from 
different domains that are expressed by the same word, and to construct complex 
network models for lexical items. In a possible cognitivist model of category inter¬ 
relationships, the lexical item head will represent a subordinate category shared by 
two superordinate categories: body-part and plant, and will thus demonstrate the 
link between these two categories established through metaphor. Also in the cognitiv¬ 
ist framework, a semantic network model for the polysemous lexical item head (or 
glowa) will be constructed in such a way as to incorporate all domains which exploit 
the cognitive structures of the body-part prototype through the process of metaphor. 

In the work of Picoche (1994, 1995), who represents the Guillaumian theory of 
language, lexicalized metaphors are explained through the mechanism of demateri¬ 
alization. Analysing the literal and the figurative senses of the French verb marcher, 
in uses such as L’ homme marche - L’ascenseur marche, Picoche argues that the meta¬ 
phorical transfer within this lexical item results in the increased abstraction of its 
lexical meaning and the resultant lexical impoverishment. Explaining the metaphori¬ 
cal uses of the word glowa/ head in lexicalized expressions of the cabbage-head type 
as dematerialization might imply that the item in question is seen as being on its way 
to become grammaticalized (as do or will have grammaticalized in English from full 
lexical categories in I will do it; God has willed it so to grammatical categories in I do 
intend to go; I will go). The dematerialization hypothesis can perhaps be maintained 
if the adverbial or adjectival derivatives of the noun head/ glowa are brought in as 
evidence (such as headlong or He is ahead of me in English or dworzec glowny ‘head 
station - central station, or glownie ‘mainly’ in Polish). If the word in a lexicalized 
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metaphor does not change its part-of-speech status, however, the grammaticalization 
claim appears counterintuitive and premature, especially in Polish, whose grammati¬ 
cal system still makes productive use of diminutives or augmentatives to indicate, I 
would like to argue, that the word in question is used metaphorically I suggest that 
the word glowa is used in its diminutive form in the lexical entry glowka kapusty cab¬ 
bage head’ not to point to its small size or the speaker’s affectionate attitude, but to 
signal to the hearer that it should not be interpreted literally 4 . On the other hand, I 
find the use of the base form glowa in the expression glowa kapusty marked, for it 
clearly implies a reference to the cabbage head’s big size. 

2. +head is mind+ conceptual metaphor in polish. According to traditional 
rhetoric, metaphorical expressions considered in this paper represent two figures of 
speech: metaphor, as in (1) and metonymy, as in (2). Metaphor has been traditionally 
defined as a rhetorical device based on notions of similarity’ or ‘comparison’ between 
the literal and the figurative meaning of an expression, whereas metonymy has been 
limited to uses based on the principle of contiguity’ (i.e., nearness or neighbourhood) 
between what is denoted by the literal meaning of a word [in an expression] and 
its figurative counterpart (see Ungerer & Schmid 1996:115). The classical distinction 
between metaphor and metonymy is preserved in cognitivist theory even though in 
essence the two categories represent very similar processes. Both are conceptual in 
nature, both can be lexicalized, and both are based on the principle of source-to- 
target mapping. The difference between them is that whereas in metaphors the trans¬ 
fer occurs across different domains (from one domain to another), in metonymies, it 
takes place within the same domain. 

(2) a. Wypada po 3 dolary na glowf (= na osobf). 

It comes to 3 dollars per head (= per person), 
b. Kowalski to tfga glowa. 

‘Kowalski is a big head’. 

Kowalski is very bright; he is a brain. 

The relationship between the literal and the figurative meanings of glowa 'head’ in 
the Polish and English uses in (2) is typical of the classical pars pro toto (+ part for 
whole +) metonymy: in (2)a the word head, referring to a part of the human person 
stands for the whole person. The domain of the source and the target category is the 
same: the human person. In (2)b, Kowalski to tfga glowa, there is a combination of 
the + part for whole + metonymy (Kowalski’s head stands for Kowalski) and the 
+ head is intellect + metaphor (Kowalski is equated to his intellect, and the size 
of his intellect is alluded to by reference to the size of his head). The Polish expres¬ 
sion fgga glowa ‘a big head’ refers to superior intellect whereas in English big head sug¬ 
gests arrogance and conceit. Other types of metonymy distinguished in cognitivist 
literature are based on the contiguity relations listed in (3), after the table quoted by 
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Ungerer and Schmid (1996:116) on the model of the collections of metonymy types 
provided, among others, by Lakoff and Johnson (1980:38^: 

(3) Contiguity-of-referents relations in metonymy (Ungerer & Schmid 1996:116) 
+part for whole+ ( all hands on deck) 

+whole for part+ (to fill up the car) 

+ CONTAINER FOR CONTENT+ ( I’ll have U glass) 

+material for object+ (a glass, an iron) 

+producer for product+ (buy a Ford) 

+place for institution+ ( talks between Washington and Moscow) 

+place for event+ ( Watergate changed our politics) 

+controlled for CONTROLLER!- (the buses are on strike) 

+cause for EFFECT+ ( his native tongue is German) 

From the point of view of the distinction between metaphors and metonymies, 
the second, sentence-based group of the Polish and English metaphorical expres¬ 
sions involving the notion of gtowa/head would fall into the category of metonymy, 
mainly of the -(-container for content+ type. In both English and Polish, the 
body-part head is conceptualized as the container for the brain, and by subsequent 
metaphorical-metonymic extension +brain is the container of mental objects-)-, 
as the seat of thoughts, ideas, wisdom and intellect; in short, as a management centre 
of all mental activity. Examples U)a-e illustrate metaphorical mapping from the 
source domain head to the target domain mind achieved through the combination 
of the metonymy +the head stands for the brain+ and the metaphor +locus 
of the brain (head) is mental ACTIVITY-)-. To follow the analytical procedure 
proposed by the cognitivists, metaphorical transfer from the domain head to the 
domain mind can be described by a series of specific structural mappings. Five such 
mappings are suggested below for Polish with representative metaphorical expres¬ 
sions as their justification. 

(4) Structural mappings from the domain head to the domain mind in 

Polish: 

a. HEAD IS A CONTAINER FOR WISDOM, KNOWLEDGE AND THOUGHTS 

Nie miesci mi sig to w glowie. 

‘It overflows my head-, it’s too big for my head’ 

It [the idea] is utterly and completely incredible; it’s beyond my under¬ 
standing. 

On nie ma oleju w glowie. 

‘He has no oil in his head’. 

He has no brains. 
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Ladowat mu wiedzf (lopatq) do glowy. 

‘He loaded knowledge into his head with a shovel’. 

‘Kto nie ma w glowie, ten ma w nogach’ (a proverb) 

‘He who doesn’t have it in his head, he has it in his feet.’ 

He who doesn’t think, has to run. 

The container metaphor is one of the most important (Lakoff uses the term‘primary’) 
image-schema patterns used for conceptualizing abstract concepts. The head-as-con- 
tainer examples U)a constitute the largest group in the HEAD-to-MiND mapping in 
Polish (see Doroszewski 1980, Dunaj 1996). They have been classed together on the 
basis of their predicate verbs, such as miescic sif ‘squeeze in, fit’, ladowac ‘load’, miec 
w ‘have in’, which expressly evoke an image of a container, and because of the preva¬ 
lent use of the preposition w ‘in with the noun glowa, which suggests location within. 
Knowledge and wisdom contained in the head are pictured metaphorically as a solid 
mass material that can be loaded in (with a shovel, like coal), or as a liquid (oil) that 
can be poured in to introduce light (cf. Pomysl zaswital mi w glowie ‘The idea dawned 
on me’). Frequently, the contents of the container head is referred to in a general 
manner by the neuter (indefinite gender) pronoun to ‘it’. 

(4) b. HEAD IS A SEAT/ A LOCUS OF THOUGHTS, IDEAS AND WISDOM 

Skqd ci to przyszlo /wpadlo do glowy ? 

‘Where did it come/ fall into your head from?’ 

What put that [idea] into your head? 

Wylecialo mi to z glowy. 

'It flew out of my head’. 

It has gone clean out of my head; I completely forgot about it. 

Zachodzf w glow f, jak to sif stalo. 

‘I am going deep into my head to figure out how it happened’. 

I am thinking very hard to figure out how it happened. 

Musz fpojs po rozum do glowy. 

‘I have to go to the head to fetch some wisdom. 

I have to set my wits to work (to do what makes sense, to make a sensible 
decision) 

The head is also represented as ‘home’ for mental objects such as thoughts and ideas. 
These objects are not locked in or controlled by the owner of the place at which they 
reside, for they are represented as things that enter ( przyjsc 'come to’, wpasc fall/ drop 
in’) or leave (wyleciec z ‘fly out of’, wypasc z fall out of’) their residence. When one 
needs to use his wits (rozum), the head is the place he can fetch them ( isc po) from 5 . 



70 


BARBARA BACZ 


(4) C. HEAD IS THE BEARER OF ONE’S PREOCCUPATIONS 

Mam dzisiaj duzo spraw na glowie. 

‘I have many things on my head today.’ 

I have a lot of things to do today. 

Goscie siedzq mi na glowie juz dwa tygodnie 

‘Guests have been sitting on my head for the last two weeks.’ 

I have been preoccupied with my guests for the last two weeks; it’s a bit 
too much. 

Nareszcie mam ten egzamin zglowy 
‘At last I have that exam off my head’ 

I have finally got rid of that exam; I don’t have to worry about it 

Polish idiomatic uses illustrated by the sentences in (4)0 are based on the structural 
characteristic of the body-part head which is not exploited in the HEAD-related meta¬ 
phorical expressions in English. The head in these idioms is conceptualized as that part 
of a person which bears all of one’s preoccupations, which are viewed with a somewhat 
negative attitude (present and future problems, worries and all things one has to take 
care of). The basic linguistic proof of that interpretation derives from the semantics of 
the Polish locative preposition na ‘on. The impression produced by the expressions miec 
cos na glowie/ miec cos z glowy ‘to have something on one’s head / to have something 
off one’s head’ is that whatever is placed on the head (and usually these are concrete 
things denoted by concrete nouns such as exam, guests, wedding, etc.) is perceived as 
a burden one would gladly get rid of. The closest English equivalent of that mapping 
would be found in the idiom to get something out of one’s hair, as in Get it out of my hair 
meaning ‘You take care of if which draws on the inherent structure of another body- 
part category, the hair. It seems that in English the hair, and not the head, is perceived 
as the locus of the owner’s burdensome (and therefore negative) preoccupations 6 . 

Other metaphorical expressions in Polish and English which exploit the domain 
head in one language and the domain of another body-part in the other language 
include the English idioms which use the concept head to refer to the position 
in the front for which Polish consistently employs the concept forehead (at the 
head of the procession = na czele procesji ‘at the forehead of the procession’; head- 
manager = naczelnik), or the various Polish idioms with the word glowa whose mean¬ 
ing is rendered in English by metaphorical mappings from the body-part domains 
skin and back ( nadstawiac glowg ‘to stick out one’s head’ = to risk one’s skin; glowa 
do gory ‘head up’ = tails up). 

(4) d. DAMAGE TO THE HEAD IS DAMAGE TO THE ACTIVITY OF THINKING 

Czys ty upadl na glowfl 

‘Have you fallen on your headV 

Are you crazy? Can’t you think straight? 
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Mam dzis urwanie gtowy. 

‘My head is being torn off today.’ 

I am extremely busy today. 

Glowa mi pfka. 

‘My head is cracking’. 

I have a splitting headache; my thinking is impaired. 

To marzenie sciftej glowy. 

‘This is the dream of a head that has been cut off’. 

This is utter impossibility. 

The uses in (4)d provide indirect support for the specific metaphorical mapping 

+HEAD IS THE MEDIUM (AND PLACE) FOR/OF THE ACTIVITY OF THINKING+ by Sug¬ 
gesting that damage to the medium of mental activity implies damage to the activity 
itself. When a Polish speaker wants to say that somebody’s thinking is impaired 
or that some idea is utterly unreal, he evokes an image of a head that is cracking 
(glowa peka), that has been fallen on ( upadl na glowe), that is being torn off ( urwanie 
glowy) or even cut off ( marzenie sciqtej gtowy). The examples in (4)e, on the other 
hand, imply that the head is perceived not only as a thinking medium but as a tool, 
a mechanical device that will trigger the activity of thinking when stimulated: by 
knocking ( postukaj sif w glow f), turning (pokrfcic glowq), moving (rusz glowq), even 
breaking ( lamac sobie glowf). The use of the instrumental case on the word glowa is 
significant for the metaphorical mapping +head is an instrument for thinking+. 
As shown by the translation equivalents of the Polish examples in (4)e such as ‘Don’t 
rack your brains about if (Me lam sobie tym glowy), English exploits the domain of 
the body-part brain to render the notion of a thinking tool via metaphor. 

(4) e. HEAD IS A MEDIUM (SIMPLE INSTRUMENT) FOR THINKING 

Muszf dobrze pokrfcic glowq, zeby zarobic te pieniqdze. 

‘To make a profit I have to turn my head real hard’. 

I have to think hard. 

Rusz glowq. 

'Set your head in motion’. 

Start thinking. 

Postukaj sif w glowf. 

‘Knock yourself in the head’. 

Use your senses; be reasonable. 
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(4) e. Nie lam sobie tym glowy. 

‘Don’t break your head over it.’ 

Don’t rack your brains about it. 

(cf. a cross-word puzzle = lamiglowka ’ what breaks your head’; French 
casse-tete) 

3. conclusion. Lexicalized metaphors from the domain body-part head to the 
domain plants are to be found in both Polish and English, and in both languages they 
are based on the mapping of two structural characteristics of the source category: the 
head’s round shape and its position ‘up’ with respect to the rest of the body. 

The anthropocentric character of the source-domain body-part head exploited 
in the metaphorical rendering of our understanding of the target-domain mind 
implies that the experience on which the +head is mind+ metaphor is based is very 
similar across languages. According to the prediction made by Lakoff and Johnson 
(1980) that common experiential motivation should result in the possible universality 
of the metaphoric mechanisms, we should expect far-reaching similarities in head- 
related metaphorical expressions in Polish and English following from the similarity 
in the specific cognitive structures that can be mapped. Sample analysis of the Polish 
expressions with the word glowa carried out here shows that the universality claim 
has to be taken with caution, even in the case of the relatively simple (non-cultural) 
metaphor which exploits the concept of the human head to talk about matters of the 
human mind. At least one cognitive structure +head is the bearer of one’s pre¬ 
occupations-!- in the source category has been found to be present and exploited 
in Polish, but not in English. The reason for this divergence does not lie in the cul¬ 
tural differences, such as those discussed by Cienki (1999) or by Pamies and Inesta 
(in print) in what they call ‘cultural metaphors’, but in the differences in the concep¬ 
tualization of the source concept by the speakers of the two languages. I think that 
the differences in the way speakers of different languages exploit cognitive structures 
inherent in identical source-categories (such as body-parts) are due not only to cul¬ 
tural factors affecting their conceptualization of these categories but are also caused 
by the specificity of the grammatical system they have at their disposal when using 
conceptual metaphor. More research in prepositional and case semantics is needed 
to substantiate the claim that the grammatical structure of a language (in this case, 
Polish) can have a bearing on metaphoric realizations in that language. 


I would like to thank two anonymous reviewers of this paper for their helpful comments 
and suggestions. 

The size factor has been pointed out to me by Bill Sullivan as a source for a possible gener¬ 
alization. 

In the Polish expression glowka czosnku is head of garlic (a garlic bulb), which obviously 
violates the position ‘up’ generalization, the shape characteristic appears so important that 
it overrides all other structures of the source-domain in the metaphor. In the discussion 
following the presentation of this paper, Stephen Straight has suggested to me that the 
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decomponentiality characteristic of a head of garlic, which is also present in a head of 
lettuce or cabbage and in the heads of sunflower, daisy or clover (all of which contain 
elements easily separable into smaller parts: gloves, leaves or petals) may be responsible 
for the presence of the same word head in their names. If the decomponentiality charac¬ 
teristic is taken as the structure responsible for metaphorical transfer from the domain 
body-part head, we would have to assume that the human head is conceptualized as a 
composite of its parts: the nose, the eyes, the mouth, etc., or at least that it contains an 
element easily separable into more or less identical smaller units (the hair?). 

Additional evidence is furnished by examples such as missile head ‘glowica [augmentative] 
pocisku’ or pin head ‘glowka [diminutive] szpilki’, etc. 

The English category wits is rendered in Polish by the uncountable noun rozum, which 
can also signify brains. In other idioms (e.g. wlewac komus olej/ rozum do glowy ‘pour oil/ 
brains into somebody’s head’), rozum is conceptualized as a light-producing liquid. 

The English expression to have something on one’s mind, with the preposition on, applies 
to persistent thoughts about something which can be either negative or positive, unlike 
the expression to have something in one’s hair, with the preposition in, whose referent can 
only be negative. The idiom to have something on one’s mind (as in: You are always on 
my mind) is rendered in Polish by expressions belonging to the +head is a container 
of thoughts-! subcategory, with the prepositions w ‘in’ or po ‘all over, about’. Examples: 
Ciqgle siedzi mi to w glowie ‘It [that thought] is still sitting in my head’, with the verb 
siedziec ‘to sit’ indicating persistence, and the locative preposition w ‘in’ indicating location 
within, or: Ciqgle mi to (ta mysl) chodzi po glowie ‘It (that thought) is constantly walking 
about in my head’, with the imperfective verb chodzic ‘walk’ and the locative preposition 
po ‘all over’ referring to the area covered by the subject’s movement. 
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DATA AND SCIENCE IN INTRODUCTORY LINGUISTICS TEXTBOOKS 


Douglas W. Coleman 
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Previous studies (e.g., Botha 1970; Itkonen 1978,1981; Yngve 1986,1996; Coleman 
1999, 2000) have demonstrated that the usual (i.e., introspective) ‘data of theoretical 
linguistics and the ‘data of science are quite different. 

That, and only that, which happens or obtains in space and time can be 
observed... [IJntuition, for instance, is not a form of observation. To call all ways 
of gaining knowledge by the same name, i.e.,‘observation,’.. .conceals important 
methodological differences. (Itkonen 1978:3) 

In addition, authors in theoretical linguistics typically conflate ‘examples’ (abstract 
types or classes of events) with ‘data ( tokens , or concrete events). These examples 
consist of cited forms which ‘are typically not tokens corresponding to individual 
concrete observable “real-world” events’ but represent an ‘entire class of observation 
tokens’ (Coleman 2000:77). 

Is this state of affairs merely a historical ‘accident’ owing to an ongoing influence 
of the philosophy of language on the field of linguistics, unconsciously transmitted by 
experienced linguists to each generation of those new to the field? If so, then trans¬ 
mission will occur merely by new linguists’ observation of the methodological prac¬ 
tices of those who come before. If not , then we should find explicit instruction in the 
textbooks that serve to provide foundational training to new linguists. 

I have investigated occurrences of certain key concepts in a broad range of intro¬ 
ductory textbooks in linguistics, in the process looking at some of the same concepts 
as those subsumed under lexical analyses in my earlier research on theoretical and 
applied linguistics corpora drawn from professional journals (Coleman 1997,1999, 
2000). I focus on three major aspects in the texts: (1) what they have to say about 
the status of linguistics as a science, (2) what constitute(s) the object(s) of study in 
linguistics, and (3) what constitute the data to be used by linguistics in its examination 
of this object. 

1. do introductory texts say linguistics is a science? Saussure very clearly 
identifies linguistics as a science when he says, ‘Linguistics is closely related to other 
sciences that sometimes borrow from its data, sometimes supply it with data (1959 
[i9i5]:6). Shortly thereafter, he makes the point even more directly: ‘In setting up 
the science of language within the overall study of speech, I have outlined the 
whole of linguistics’ (ibid:i7). Bloomfield (1933) refers to linguistics as ‘the science of 
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language’ (vii) and near the opening of his first chapter says,‘It is only within the last 
century or so that language has been studied in a scientific way...’ (3). Nida makes 
his stand clear when he asserts that ‘practical results should be based upon scientific 
analysis’ (1976 [1949] :i, fn. 3). Whatmough refers to ‘the new science of general lin¬ 
guistics’ (1956:13), and in his glossary defines linguistics as ‘the scientific study of 
the phenomena of language’ (ibid:233); he attributes linguistics’closer relation to the 
exact sciences’ to the ‘application of symbolic logic and mathematics to linguistic 
problems’ (ibid:i3). Whorf (1956) describes linguistic phenomena as ‘the province of 
the grammarian—or of the linguist, to give him his more modern name as a scien¬ 
tist’ (211). He says other things that make it clear he views linguistics as a science, 
as when he refers to ‘the science of linguistics’ (e.g., on page 249). Whorf even com¬ 
pares linguistics to physics at one point (223). Crystal refers to linguistics as ‘the sci¬ 
ence of language’ (1987:412) and equates the terms ‘linguistics’ and ‘linguistic science’ 
(ibid:405). Akmajian et al. (1990:5) call linguistics ‘the scientific study of human nat¬ 
ural language’. In Gee’s (i993:vii-viii) preface an extended simile of science, play, 
and linguistics is offered. Much later, on page 186, he makes the comparison explicit 
when he writes,‘linguists must, like all other scientists...’ Clark et al. call linguistics 
‘the scientific study of language’ (1994:62) and linguists ‘the scientists of language’ 
(ibid:i5). Finegan calls it‘the scientific inquiry into human language’ (1994:13). Macau¬ 
lay (1994:3) calls linguistics ‘the science of language’, and O’Grady et al. refer to their 
topic as ‘the realm of scientific linguistic analysis’ (i997:xxiii). Fromkin and Rodman 
(1998:26) refer to‘the science of linguistics’. Radford et al. argue (1999:2) that the lack 
of‘controlled experiments on large numbers of subjects under laboratory conditions’ 
does not make linguistics ‘unscientific’; they also claim, for example, that the theory 
of UG follows ‘standard scientific practice’ (ibid:7). Weisler and Milekic (2000:130) 
ask, ‘ [C] an you think of any other science in which subjects provide researchers with 
data based on intuitions?’ (Indeed, we cannot, unless we include parapsychology and 
paraphysics, for example, among the sciences!) 

Some writers are less clear on the issue. Sapir (1949 [1921]), for example, does 
not explicitly state his view one way or the other. He uses some of the lexicon of sci¬ 
ence when he refers to ‘linguistics research’ (61), the need for clear evidence’ (103), 
or declares a classification scheme to be experimental’ (140)—but such references are 
sporadic and few. He contrasts a linguistic analysis with a ‘philosophical analysis of 
experience’, but does so on the basis that it is ‘on a sliding scale’ (107), not on the basis 
of a philosophy-vs.-science dichotomy. Bonvillain defines linguistics merely as ‘the 
study of interrelationships between language and other aspects of culture...’ (1993:393). 
Jannedy et al. (1994) define a linguist noncommittally as ‘someone who studies the 
structure of language and its use’ (457). But interestingly, while the authors describe 
phonetics as ‘an experimental science’ (33), phonology, morphology, and syntax‘study’ 
or are ‘the study of’ certain aspects of language (87, 129, 165); semantics ‘addresses 
and attempts to answer’ certain questions (211); and pragmatics ‘concerns itself’ with 
other things (223). Napoli refers to linguistics as a field that is ‘technical’ (1996W) 
and ‘rich, formal, exciting’ (ibid:vii). She states that linguistics is defined not so much 
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by its material as ‘the way [linguists] look at data, the manner in which they argue, 
and the range of conclusions they find sensible in light of the data (ibid). However, 
nowhere—so far as I can tell—does she explicitly identify linguistics as a science, even 
though this is strongly suggested by the preceding quotation. Hudson (2000) nowhere 
explicitly identifies linguistics as a ‘science’, but does contrast the approach of those 
working within ‘generative grammar’ with that of‘empiricist linguists’ (499), the latter 
of which maybe understood to work within the domain of science. Similarly, although 
Matthews cites a common definition found in general dictionaries—‘the science of 
language’ (i997:vii), he also immediately thereafter cites what he terms a ‘more cautious 
wording’—‘the branch of knowledge that deals with language’. As I will show, almost 
all of those who are in this fashion unclear or noncommittal on the scientific status of 
linguistics use the lexicon of science in referring to ‘hypotheses’,‘data, and so on. 

In my entire informal sample, the authors of only one introductory text openly 
express doubts about the scientific status of our field. Southworth and Daswani 
(1974:8-9) say the drive ‘to create a more empirical base for their field’ 

.. .has not, however, been strong enough to prevent bias in data selection and 
presentation, or in the selection of problems for study, because of assumptions 
about the nature of language which are based on philosophical positions, rather 
than on the linguistic data alone. 

2. WHAT DO THE TEXTBOOKS SAY IS THE OBJECT OF STUDY OF LINGUISTICS? Saussure 
(1959 [1915]) identifies the‘scope of linguistics’ as‘(a) to describe and trace the history 
of all observable languages..., (b) to determine the forces... at work in all languag¬ 
es..., and (c) to define and delimit itself’ (6). With the exception of (c), the object of 
study of linguistics is clearly identified as language. Shortly after this, under the head¬ 
ing ‘Definition of Language’ (7), his key question is ‘What is both the integral and con¬ 
crete object of linguistics?’ Is Saussure unaware of the problem regarding language 
as a ‘concrete object?’ No. In his discussion of the neogrammarians (5, fn. 2), he says, 
‘language is not an entity and exists only within speakers’ and later, in this oft-quoted 
excerpt (8): 

Other sciences work with objects that are given in advance... but not linguistics. 
... [In linguistics, f] ar from it being the object that creates the viewpoint, it is the 
viewpoint that creates the object. 

He resolves this as follows: ‘One must not go too far, however, and a compromise is in 
order’ (5) and ‘...from the very outset we must put both feet on the ground of language 
and use language as the norm of all other manifestations of speech’ (9-emphasis his). 

Others have followed his lead. Bloomfield (1933:27) calls language ‘the subject 
of our study’, clarifying the scope of linguistics by distinguishing ‘the act of speech 
and the other occurrences... practical events’ (23). In his terms, the latter are com¬ 
posed of ‘stimuli and reactions’ and are ‘real... events’ (27). But to him, ‘as students of 
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language... we are concerned precisely with the speech event’ (26-27). Nida defines 
morphology, for example, as ‘the study of morphemes and their arrangements into 
words’ (1976 [1949]). Whatmough (1956) says, ‘This book is about language’ (18) and 
identifies the objects of study in linguistics as ‘the phenomena of language’ (233). 
Whorf (1956) refers at various points to the examination or analysis of a language or 
of languages (e.g., on page 212). Southworth and Daswani (1974:6) explicitly identify 
the object of study of linguistics as ‘the language itself (emphasis theirs). Crystal iden¬ 
tifies language more obliquely as an ‘object of fascination’ but does associate its study 
with linguistics (1987:404). Akmajian et al. (1990:5) identify the object of study of lin¬ 
guistics as ‘human natural language’ and‘the nature of language and communication’. 

Gee (1993) focuses on ‘the study of language’ (ix) and identifies ‘a large part of what 
linguists study’ as ‘the words and syntax of particular languages and of language in 
general (language universals)’ (13). He broadens his definition of the scope of lin¬ 
guistics when he says, ‘linguistics also involves moving beyond words and the syntax 
of phrases and sentences to study how language is used in real contexts’ (the empha¬ 
sis on ‘real’ is mine) (13). He seems to see this as a positive thing, despite the impli¬ 
cation that most of linguistics deals with unreal contexts. Gee here also repeats the 
solipsism that language is an object (like a paintbrush or a pie fork) that can be used. 
Those who would defend a reference to language as a ‘usable object’, calling it merely 
a metaphorical convention, should note what he says about the reality of the objects 
of language: 

I will call entities like cells and genes, or molecules and atoms theoretical entities. 

I call them this not because they don’t exist (they do exist), but because they are 
used to explain the properties of things that are more readily apparent and easier 
to observe than they are (things like cats and water, and the sentences one hears 
and understands). (186) 

In other words, Gee places ‘sentences’ on the same level of reality as cats and water! 

In Gee’s following paragraph (186) we see a close replay of Saussure’s conceptual 
wrestling match with the concreteness of the objects of language, and Gee comes to 
basically the same conclusion as Saussure: sentences, like cats and water, are concrete. 

Jannedy et al. (i994:ix) identify language as ‘the object of our study’; Clark et 
al. (1994) describe their book as ‘an introduction to the study of language’ (2) and 
define linguistics as‘the study... of language’ (62); and Finegan (1994:14) describes 
the object of study of linguistics as ‘particular facets of languages’ and ‘the nature of 
language itself’. Macaulay (1994:3) identifies the object of study of linguistics first as 
‘the abstract nature of linguistic structure’ and—like Gee (1993)—goes on to explain 
that more recently ‘linguists have begun to look at ways in which people use lan¬ 
guage’ (emphasis mine). Napoli (i996:v) identifies the subject of her book as ‘the core 
areas of the analysis of language’. O’Grady et al. (1997:1) say that linguistics studies 
‘the nature and use of language’ (emphasis mine). Matthews says linguistics is the‘sci¬ 
ence’ or ‘study’ of language and that it ‘deals with language’ (i997:viii). Fromkin and 
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Rodman (1998:14) say linguists ‘describe a language’ and equate ‘a true model of the 
speakers’ linguistic capability’ with ‘a successful description of the grammar and of the 
language itself’. Radford et al. identify‘a major task for the linguist’ as that of discover¬ 
ing ‘the nature of ...grammars’ (which, of course, they have already said consist ‘at 
least’ of‘a lexicon, a syntactic component, a PF [phonetic form] component and an LF 
[logical form] component’ (1999:6). Weisler and Milekic (2000:1) refer to ‘the study of 
language’ (their book is Theory of Language). Hudson identifies concerns of linguists 
as including ‘how languages are structured and work, and how they are learned’ as 
well as the application of linguistics in practical areas (2000:499). 

Sapir (1949 [1921]) does not make an explicit pronouncement on the issue, but 
makes it clear, nevertheless, that the object of study of linguistics is language—e.g., 
‘the way is now cleared for a serviceable definition of language’ (8) and in chapter 
titles such as ‘The Sounds of Language’, ‘The Form of Language’, etc. 

Bonvillain (1993) is the only one of the authors from my informal sample of 22 
books who disagrees. And she does not define linguistics, per se, as her topic, but lays 
out her subject matter as ‘an ethnography of communication. She identifies its com¬ 
ponents (85) in a quotation from Dell Hymes (1974:9), in which these include not only 
language (the ‘code’) but also the participants, the channel, the setting, the form or 
genre, and topics and attitudes of the participants. 

In all of the other texts, there is no question that the object of study is language. 
Now, suppose we allow that ‘language’ as used by all these authors might simply refer 
to a set of properties, states, processes, what have you, in an individual. This is what 
Radford et al. (1999:1), for example, claim: 

[W]hen we assert that Tom is a speaker of English, we are ascribing to Tom a 
certain mental structure. This structure is somehow represented in Tom’s brain, so 
we are also implicitly saying that Tom’s brain is in a certain state. 

This if fine, as long as we remember that we cannot observe ‘mental structure’. We can 
posit the existence of phonemes, morphemes, sentences, etc. as what Gee (1993:186) 
calls ‘theoretical entities’ corresponding to ‘mental structures’, only if by that we mean 
we are positing theoretical entities correlated somehow with physical properties of a 
person. But we cannot claim to ‘observe’ mental structures. Sentences, for example, do 
not exist in the same way as cats and water. 

Why is it a problem to accept language as the object of study of linguistics? Is it 
because elements of language are taken as theoretical entities? No. But we can see 
that taking elements of language as observable objects is quite unreasonable. In short, 
seeing language as a theory to explain observable human behavior may work (an 
empirical issue). Seeing language as a ‘phenomenon to investigate cannot possibly 
work (an ontological issue). 
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ABSOLUTE FORM 

SANDHI-FORM 

['bo:] cow’ 

[an Vo:] ‘the cow’ 

[ar 'mo:] ‘our cow’ 

[uv] ‘egg’ 

[an 'tuv] ‘the egg’ 

[na nuv] ‘of the eggs’ 

[a ‘huv] ‘her egg’ 

['ba:n] ‘white’ 

['bo: 'va:n] ‘white cow’ 

['bog] ‘soft’ 

['ro: 'vog] Very soft’ 

['bris] ‘break’ 

[do 'vris] ‘did break’ 


Figure 1. Examples from Bloomfield (1933:188) 

3. WHAT CONSTITUTES DATA IN THE INTRODUCTORY TEXTBOOKS? This leads US to the 
question of what introductory textbooks in linguistics consider to be the data of our 
field. 

The translator (Baskin) of Saussure (1959 [1915]) treats data and evidence as inter¬ 
changeable terms, apparently referring to the same thing, for example, as external evi¬ 
dence’ at one point and then as ‘external data in the next paragraph (35). In actual 
arguments, Saussure typically cites example forms: ‘French au (e.g., in sauter ‘jump’)’ 
(35); ‘Tatin crispus ‘crisp’ provided French with the root crep- from which were formed 
the verbs crepir ‘rough-cast’ and decrepir ‘remove mortar” (83). In other words, his 
data, or evidence, consists of abstract, generalized types rather than observable tokens 
(Coleman 2000). 

Bloomfield uses the term data repeatedly when writing in general terms, e.g., ‘All 
historical study of language is based upon the comparison of two or more sets of 
descriptive data (19). When presenting specific arguments, however, he uses terms 
like case (159), form, and example (160) interchangeably to refer to the material upon 
which he is basing his arguments. Figure 1 shows some of Bloomfield’s examples 
(188) from Modern Irish. Although at least one of these terms (form ) would seem 
to exclude the gloss, it is clear that Bloomfield’s arguments work only if semantic 
relationships—e.g., that between ['bo:] ‘cow’ and [an Vo:] ‘the cow’ can be assumed. 
Thus, Bloomfield, like Saussure before him, is working with pre-analyzed types, rather 
than observation-tokens. 

Since Bloomfield seems to use case, form, and example interchangeably whenever 
he directly refers to the material that constitutes his data, let’s look at three instances 
in which he uses the term form. He (1933:213) says,‘[T]he speaker who says knives... 
merely utters a form (knives )...’ Here, it appears that by form, Bloomfield refers to a 
communicative event (albeit hypothetical), which (if actual) would involve at least 
some things available for observation in a scientific sense. Eater on the same page, 
he says a description should be arranged ‘so that as many forms as possible will be 
included’. This time context makes it clear he is referring to a form as a type, not an 
observation-token. Still, it is a type representing a hypothesized class of observation- 
tokens. Elsewhere on the same page, Bloomfield refers to an underlying form in much 
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the same sense in which it has been used more recently: ‘an indefinite number of 
plural nouns retains the [-s] of the underlying form. This use oiform clearly refers to 
a type which exists only as a theoretical entity, and which therefore cannot be directly 
observed, so it has no direct relationship to observation-tokens. Such confusions of 
types and tokens are pervasive throughout Bloomfield’s Language, just as they are 
through later textbooks (just as they are through journal articles in theoretical lin¬ 
guistics [TL]—Coleman 2000). 

Sapir’s (1949 [1921]) usage is very similar to Bloomfield’s. In the body of his argu¬ 
ments, he generally refers to his material as forms, examples, or even as types. When 
speaking about method, however, as in footnote 7 on p. 212, he at least sometimes uses 
the term data: ‘By working back from such data as we may possess...’ In all cases, his 
forms/examples/types/data are, however, types, not actual observation-tokens. 

Much of Nida’s (1976 [1949]) data may have been obtained from interaction with 
informants. But like the data of Coleman’s (2000) TL corpus, some is from secondary 
sources, and it is not made clear which is from naive informants, which from descrip¬ 
tive grammars, which from scholarly articles — and therefore which may actually be 
invented material. This is not an empty charge based on an assumption of guilt until 
proof of innocence. Consider Nida’s use of the term data in the following instructions 
to an exercise on some homophonous forms in English (58): 

If the forms are related in meaning, determine whether the forms are in comple¬ 
mentary distribution. (Supplementary data must be supplied by the student.) 

Here, Nida invites the student to invent his/her own data. 

Whatmough (1956) refers at least once to a sample (184) in a way that is somewhat 
consistent with usage in the social sciences and with that of qualitative analyses in cur¬ 
rent corpus linguistics. Generally, however, he cites types, but without referring to them 
as data, forms, cases, etc., e.g.,‘and likewise in French deux cents and cens deux’ (136). 

Whorf (1956; e.g., 45, 52,124,168) most often refers to his material as examples, 
samples, or forms —in almost all cases, they are types, not observation-tokens. How¬ 
ever, there are instances in which he makes reference to material actually observed, 
e.g., in his discussion of a Mayan inscription in the Temple of the Tepoztecatl, of 
which he provides an annotated sketch (44). In discussing the inscription, he alter¬ 
nately refers to its elements with the terms form, sign, hieroglyph, hieroglyphic element, 
grapheme, figure, and glyph. While some of these (e.g., form) may arguably refer to 
observable visual data in a scientific sense, others refer unambiguously not to observ¬ 
able objects, but to inferred types (e.g ., grapheme). Yet they seem to interchangeably 
refer to the same elements in the illustration. Is this just a careless use of terminology, 
or is it indicative of a genuine confusion? Consider what Whorf (223) says elsewhere: 

Physics does not begin with atomic structures and cosmic rays, but with motions 
of ordinary gross physical objects and symbolic... expressions for these move¬ 
ments. Linguistics likewise [see below] does not begin with meaning nor with the 
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structure of logical propositions, but with the obligatory patterns made by the 
gross audible sounds of a given language and with certain symbolic expressions 
of its own for these patterns. 

But then try to put this together with the following (248): 

First, the plane ‘below’ the strictly linguistic phenomena is a physical, acoustic 
one, phenomena wrought of sound waves. 

If‘strictly linguistic phenomena are not physical phenomena, then the likewise in the 
earlier quotation is cancelled out. We see a genuine confusion of physical and non¬ 
physical (objective and subjective) domains here. 

This confusion of tokens (data) and types ( examples ) is pervasive throughout 
almost all the textbooks, mirroring that observed in the TL corpus of Coleman (1999, 
2000). Southworth and Daswani (1974:149-50) call some of the same things examples 
at one point and data at another (e.g., 150,170). Akmajian et al. (1990) also do this 
(e.g., 119) and at other points use the terms interchangeably (e.g., 186,188). In develop¬ 
ing one analysis. Gee (1993) calls his items (2i)a-(23)b examples and his (25)a-(27)b 
data. Jannedy et al. (1994) typically call the material of their exercises data , but occa¬ 
sionally say it consists of forms (e.g., 155) or examples (e.g., 207). Some authors in Clark 
et al. (1994) seem to prefer the term data over example, or vice versa, and the confusion 
arises overtly here and there (e.g., 323-24, where the same items are called examples, 
then data). Finegan (1994) calls material for analysis in parallel exercises examples, 
forms, or data (e.g., 72-73). Macaulay (1994) generally avoids the term data, preferring 
to call his material examples (and occasionally/orms); the same confusion of types 
and tokens is evident, however, when we compare references to cited forms (types) 
as examples (e.g., 154,158) and elsewhere, actual material from recorded interviews 
(tokens) as examples (e.g., 119-20). Napoli (1996) refers to the same material as data 
and examples (e.g., 129). O’Grady et al. (1997) call two sets of parallel items in the 
same analysis data and examples, respectively (69). In Fromkin and Rodman (1998), 
the same kind of material is referred to as examples in one place and as data in another 
(cf. the Russian examples on 72 and the Paku data on 96). In Radford et al. (1999:2), 
a discussion of data in linguistics includes the use of the following, referred to as 
examples: The dog chased the cat and *Cat the dog chased the. In Weisler and Milekic 
(2000:130-31), the same material is called some examples and then the previous data 
set. Hudson (2000) calls material for analysis in exercises data, but alternates between 
the terms data and example when presenting analyses himself (cf. an analysis of 
Amharic verbs 65 ff. and one of Spanish verbs 77 ff.). 

I could not find a single reference to data in Matthews (1997); he seems to refer 
exclusively to forms or examples, and in fact all his forms and examples are types, not 
tokens. He does avoid the confusion, by avoiding mention of any data. 

Most of the texts briefly mention the availability of forms of data other than 
that invented by the researcher, but provide only lip service in this regard. Crystal 
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(i987:esp. 410 ff.) is one who presents a broader view of data sources in linguistics, 
including not only information provided by informants, but experiments, corpora, 
etc. Another is Bonvillain (1993), who very consistently refers to linguistic types as 
examples and observation-tokens (properties of people, the setting, etc.) as data, 
except in her introduction (4), where she refers to all types of evidence as data. Except 
for the one case in the introduction, her usage parallels that of applied linguistics, as 
found in Coleman (1999,2000). 

4. concluding remarks. Linguistics calls itself a science. Linguists clearly aspire to 
be scientific. 

The object of study in any natural science is—to use Saussure’s terms—‘given in 
advance’. It is not created by the viewpoint of the observer. And this is even true of 
the social sciences, such as applied linguistics, in which observations are typically 
of people—their properties, and their actions, as shown by the corpus-based 
research of Coleman (1999, 2000). Theoretical linguistics can accurately call itself a 
‘science’ only if it starts framing its theories in terms of real-world (‘given-in-advance’) 
objects such as those of applied linguistics or those mentioned by ethnolinguists such 
as Hymes (1974) or—within the field of general linguistics—Yngve (1986,1996). 

I have demonstrated the pervasive confusions in introductory linguistic textbooks 
with regard to what constitutes linguistic data—confusions between observation- 
tokens and inferred types. This is a specific instance of what Yngve (1996:24,33) calls 
domain confusions, confusions between the physical and logical domains. We are 
teaching our students to be confused. Why? The linguistic community as a whole 
seems to be confused on this issue. 

We need to rely on predictive power as our primary criterion for acceptance of 
a theory in linguistics. Now, is this merely a call for us to seek what has been for 
years called ‘descriptive adequacy?’ No it is not. The predictive power of a model 
can be measured only in terms of its ‘fit’ with data. The pervasive confusion in linguis¬ 
tics over whether scientific data can consist of observation-tokens (which it can) or 
inferred generalized types (which it cannot) prevents us from being able to measure 
the predictive power of competing models. 

In short, what we need to do is to begin by scrapping all notions of‘data as some¬ 
thing the researcher can invent, for that matter, as anything existing in the non-phys¬ 
ical domain. The fact that we lack physical theories of cause-effect on which to base 
our claims may excuse us for positing theoretical entities like phonemes, morphemes, 
and so on, entities which may have physical correlates in the brain. But it does not 
justify our using as ‘data’ things that we know very well we are not ‘observing’ in any 
scientific sense. 
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VERBS OF TOLERANCE WITH THE GERUND AND THE INFINITIVE 


Patrick J. Duffley 
Universite Laval 


While tolerance has achieved recognition as one of the cardinal virtues of 
political correctness, it has not yet been the subject of much discussion among Eng¬ 
lish grammarians. The verbs which express this notion in English— tolerate, bear, 
stand, abide and endure 1 —do however present a number of points which are of inter¬ 
est for the grammarian. Most of these verbs can be construed either with the infinitive 
or the gerund. Curiously however, as noted by Visser (1973:1877), ‘a semantic differ¬ 
ence between the two alternatives—one with the infinitive, the other with the form 
in -ing —is not discernible’. This is not always the case with verbs which admit these 
two complement forms, as there is often a clear semantic distinction between the two 
constructions; cf. the examples in (i)-(3): 

(1) a. I tried to open the window. b. I tried opening the window. 

(2) a. I remembered to call her. b. I remembered calling her. 

(3) a. I regret to tell you about this. b. I regret telling you about this. 

The apparent absence of a clear semantic difference between the gerundive and infin¬ 
itival constructions with verbs expressing the notion of tolerance is thus an aspect of 
usage which needs to be explored. Are there any cases where a distinction is perceiv¬ 
able? Can any explanation be offered for the closeness in meaning between the two 
constructions? 

Another point of interest is the absence of any attested use of the to-infinitive after 
the verb tolerate itself. Infinitival constructions are attested with the verbs bear (4), 
stand (5), abide (6) and endure (7): 

(4) .. .he thrust one hand into his trousers pocket and pulled out the six slugs 
he had taken from the revolver. (...) He put the shells on the table, as 
though he could no longer bear to hold them, (buc li8 0770) 

(5) I cannot stand to stay here any longer. (Visser 1969:1326) 

(6) You know I cannot abide to light my Woodbine from the campfire. 

(bnc hwn 1538) 

(7) But Carteret’s wife, who frequented health spas, could not endure to live 
with him or he with her: there were no children, (bnc CD2 1061) 

In no case to my knowledge however is the verb tolerate construed with the infinitive, 
a fact for which an explanation will be offered in this paper. 
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A third curiosity concerning the verbs of tolerance also has to do with the infini¬ 
tival construction. All of the cases of this construction which have been attested occur 
in nonassertive contexts (cf. Quirk et al. 1985: 83-85,784-85) where the verb of tolera¬ 
tion is preceded by the modal auxiliary can with the latter being either negated or 
questioned, as illustrated in (8) and (9): 

(8) ... Stuart had such a happy, earnest look of proud possession on his face 
that Maggie couldn’t bear to do anything to quench it. (buc poi 1690) 

(9) Beats me how she can bear to enter their hovels! (lob ph 43) 

Included under the general notion of nonassertiveness are contexts containing semi¬ 
negatives (10), restrictives (11) and negative polarity items (12): 

(10) I can scarcely bear to leave you behind even for a few weeks, (lob P13 88) 

(11) He could only bear to have a doctor who did not take the absurd business 
too seriously, who realized that the whole sad joke of men living for 
only seventy years was made worse if you treated it as anything else. 

(lob ko6 54) 

(12) He found himself more and more surprised that any person could bear to 
be hard to another, (lob ko6 132) 

In contrast to the infinitive, the gerundive construction does occur in assertive con¬ 
structions, as in: 

(13) As regards simple absence of humour, I like to think I’m as fond of a good 
laugh as the next man, but I can stand doing without for long periods 
when reading, having been trained in the Oxford English school... 

(lob G36 131) 

(14) Here Mrs. Lowyer endured being alternately frozen and scalded by a terri¬ 
fied junior, then her hair wound around rollers and her scalp speared with 
plastic pins, and the torture was finally rounded off by being put under a 
red-hot dryer which she was unable to make cooler, however much she 
turned the knob on the dial, (bnc FA5 2231) 

(15) Such fibrous-rooted subjects as bedding plants, lettuces and most herba¬ 
ceous perennials will tolerate being transplanted, (bnc acx 1586) 

Other aspects of usage could be mentioned, but those referred to above will present a 
sufficient challenge for our explanatory hypothesis. 

This hypothesis holds that the explanation of complement usage of the gerund and 
the infinitive requires taking into account the following three parameters: 

a. the semantics of the verbal complement itself, i.e. the -ing form or the 
to-infinitive 
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b. the function of the verbal complement with respect to the main verb 

c. the lexical meaning of the main verb. 

Regarding the semantics of the verbal complement, the -ing form will be assumed 
to have its ‘gerundive’ value in usage with the verbs of toleration 2 . In this actualiza¬ 
tion of the -ing, the aspectual value of imperfectivity associated with the -ing in the 
progressive construction is not present. This can be seen from the compatibility of 
the gerundive -ing with the preposition after as compared to the impossibility of the 
progressive: 

(16) a. * After he was eating his hamburger, he went home, 
b. After eating his hamburger, he went home. 

The gerundive -ing evokes an event as a whole much like a noun does (cf. Duffley 
1995:8-10); compare (i6)b above with (i6)c below: 

(16) c. After the meal, he went home. 

The -ing’s function with the verbs of toleration is also like that of a noun: it simply 
performs the role of direct object with these verbs. This can be seen from the possibil¬ 
ity of replacing it by a nominal pronoun: 

(17) a. I can’t stand seeing you lick the boots of a man like Willi. (Visser 1973:1877) 
b. I can’t stand it. 

and the possibility of passive constructions, which show that the -ing corresponds 
semantically to that which is ‘X ed’: 

(18) a. Some people can stand living alone for long periods of time. 

b. Living alone can be stood by some people for long periods of time. 

In contrast, the fo-infinitive does not allow passive constructions (19), nor sub¬ 
stitution by nominal pronouns, the anaphoric form corresponding to the infinitive 
being a stranded to (20): 

(19) a. Some people can stand to live alone for long periods of time. 

b. ?*To live alone can be stood by some people for long periods of time. 

(20) a. I cannot stand to stay here any longer. 

b. ... and they can’t stand to either. 

c. *... and they can’t stand it either. 

It will be held here that the infinitive is a prepositional phrase whose relation to the 
main verb is defined by the preposition to. The latter represents the infinitive’s event 
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as the endpoint of a movement leading from the point in time corresponding to the 
main verb’s event up to that of the infinitive (cf. Duffley 1992:16-18). According to 
whether this movement is understood to be realized or not, the infinitive will be 
evoked as the goal or result of the main verb’s event. In either case, it is represented as 
subsequent to the latter. 

This leaves the third and crucial factor—the lexical meaning of the main verb. Two 
of the five verbs under study involve the notion of holding up under some sort of 
strain or pressure without yielding: bear and stand. Bear is based on the metaphor 
of supporting a weight and is defined by Webster’s (1966) as ‘sustaining an affliction, 
onerous or difficult, without breaking or flinching’; stand is based on the metaphor of 
maintaining an upright position in the face of attack or aggression and is defined as 
‘to resist without yielding or retreating’. Two other verbs involve the idea of making it 
through the duration of a trial without succumbing: endure and abide. The latter also 
has overtones of overcoming impatience or disgust (cf. I cannot abide such people; I 
can’t abide the taste of caraway). Lastly, tolerate is defined as ‘to endure with forbear¬ 
ance and restraint’ and evokes the idea of not doing anything to combat or eliminate 
a source of affliction or disturbance. 

A further grouping may be made of the first four verbs, which contain an ele¬ 
ment of meaning which is not present in the last one. Bear, stand, endure and abide 
all involve an idea of coming through a trial or difficulty successfully: bear and stand 
denote a holding up under strain or attack without giving way or flinching; endure 
and abide evoke making it through the duration of a trying situation without suc¬ 
cumbing to weariness, impatience or disgust. This element of meaning, it will be 
claimed here, is what explains the construal of these verbs with can and the fo-infin- 
itive. The notion of facing a trial successfully has a natural connection to the idea 
of whether someone has what it takes to hold up without succumbing, i.e. the capac¬ 
ity to bear, stand, endure or abide. From the point of view of the capacity of facing 
a trial, it is also quite natural to conceive the trial itself as something subsequent to 
this capacity. Since the trial or difficulty to be faced is expressed by the infinitive, it is 
understandable that the latter should be represented by means of the preposition to as 
an after-position with respect to the capacity of facing it. Thus a construction such as 
(21) means ‘he did not have the capacity required to say an unkind word to anyone’: 

(21) He could not bear to say an unkind word to anyone, (lob co8 76) 

The fact that all of the uses of the to-infinitive occur in negative or nonassertive con¬ 
texts also lends support to our hypothesis: the notion of a lack of what is required to 
get through the infinitive’s event successfully or a doubt as to whether one has what it 
takes to hold up under it makes the before/affer relation between the main verb and 
the infinitive particularly salient. 

In contrast, the three cases of the gerund construction observed thus far with 
can + negation all evoke existent situations: 
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(17) a. I can’t stand seeing you lick the boots of a man like Willi. (Visser 1973:1877) 

(22) If it hadn’t been her dad’s week for night shift, her mum wouldn’t have 
minded. As it was, Mum couldn’t stand being alone in the house at 
night... (lob G36 131) 

(23) ... John had drifted away from the gang. Mousie said it was because he 
was too proud to stand pity. Others thought he couldn’t stand seeing 
Linda, Mrs. Bobbie Evans, still so beautiful, so much in command of 
everything, (buc P17 0610) 

Here it is a lack of the capacity to endure what actually exists or has existed which is 
evoked. It is noteworthy that the -ing could not be used in a context such as (21) above 
where the complement’s event is understood to be non-realized. 

This is not to say that the to-infinitive cannot be used to refer to existing events. 
Such is the case in: 

(24) Do please tell Pepita. I can’t bear to see her like this, (lob po8 106) 

However the infinitive is associated with the future-oriented impression of‘any longer; 
any more’ in this type of use. This notion is even sometimes explicitly expressed as in 
(4) and (5) above. The infinitive is not possible however in contexts where the refer¬ 
ence is to the actual sustaining of an actualized event. This is the case in (14), where 
the verb endure is not preceded by a modal and occurs in the past tense. Here there 
is no before/after relation between the enduring and the being alternately frozen and 
scalded; the -ing complement evokes simply that which was endured by Mrs. Lowyer. 

Turning to the verb tolerate, we can now offer an explanation for the fact that this 
verb is not construed with the fo-infinitive. Since it does not involve the notion of suc¬ 
cess in coming through a trial or difficulty (but rather the idea of putting up with a 
situation) it does not pose the question of whether the subject has what is required to 
hold up under a trial without succumbing. Consequently the difficult situation is rep¬ 
resented as that which is or cannot be tolerated and not as something which the sub¬ 
ject lacks the capacity to sustain without giving way. This can be illustrated by (25): 

(25) Rose no longer wished to go out, nor could she tolerate being left on her 
own, so she became totally dependent on other people, (bnc aso 1168) 

However since in many cases the inability to tolerate something logically implies the 
lack of the capacity to sustain it without losing one’s patience, a sentence such as (25) 
is very close, as far as the overall message conveyed is concerned, to the construction 
with the fo-infinitive nor could she bear to be left alone. It is the choice of the linguistic 
means for conveying the message—in this case the choice of the lexical notion of‘tol¬ 
erating’ rather than that of‘bearing’—which situates the speaker mentally on the level 
of not putting up with the event itself, rather than on the level of lacking the capacity 
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to perform it without giving way or losing one’s patience, a view which allows a con- 
strual of the absence of the capacity as a before-position with respect to the being left 
alone. 

As can be seen from the remarks above, we are in a very delicate area of English 
grammar where the lexical notions are closely related to one another and difficult to 
tease apart. We hope to have shown nevertheless that English usage is not haphazard 
with verbs of toleration and that the special behaviour of the verb tolerate is due to its 
lexical content being slightly different from the other members of this tiny semantic 
field. It remains to be seen whether our hypothesis will stand the test of further scru¬ 
tiny and whether the insights gleaned from this examination of verbs of toleration 
can be related to complementation constructions with the infinitive and the -mg form 
after other types of verbs. 


The construction put up with is not considered in this study as it is not a simple verb but a 
complex phrase made up of verb + particle + preposition. Its exclusive construal with the 
-ing form seems attributable to the incompatibility between the preposition with and the 
to-infmitive phrase, an incompatibility which is characteristic of ah prepositions. 
According to the traditional view, the -ing is held to be a ‘gerund’ when it has the function 
of a noun in the sentence (cf. Quirk et al. 1985:1292). Such is the case in a use such as Paint¬ 
ing a child is difficult , where the -ing form fulfils the role of subject. For a discussion of the 
semantic relation between the ‘gerundive’ and ‘participial’ uses of the -ing see Duffley 1995. 
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In the last few years I have been exploring the notion of fashion in lexical choice. In 
the process of trying to be alert to vogue expressions, I have been hearing and seeing 
in print the pronoun you not directed to a specific addressee, as in (1) and (2): 

(1) You have the KLA attacking some of the French troops. (Georgie Anne 
Geyer, Washington Week in Review, 20 August 1999) 1 

(2) You did not stay in the Commodore for the quality of its rooms. The only 
thing that came with your room at the Commodore was a 16 per cent 
service charge, and whatever you found in the blue-and-gold shag rugs. 
(Thomas Friedman, From Beirut to Jerusalem, 1989) 

After thirty years of teaching English composition, my immediate reaction is to 
grab my green pen and caution, ‘Use precise vocabulary. You is a second person pro¬ 
noun directed to an addressee. You are not addressing a speaker/reader here. Revise 
this sentence’. 

What really needs to be revised is my own understanding of you. The sheer volume 
of examples I keep encountering in my sporadic efforts at observation attests that in 
ordinary, clear, spoken English and even in careful writing, the pronoun you is simply 
not being confined to addressing a specific audience of one or more. This paper gives 
examples of extended functions of you in current American English and interprets 
them as an effect of the movement towards informality in American society during 
the twentieth century. 

The non-addressing function of the second person pronoun (for convenience 
sometimes called indefinite you ) has been part of English usage for more than four 
centuries. Most reference books about the English language do include indefinite you, 
but usually not in detail and often merely to characterize it as inappropriate in writing 
because of its colloquial or informal flavor. 

Two of the handbooks most widely used in American colleges and universities— 
TheAllyn and Bacon handbook, 3rd ed. (1992) and The Harbrace college handbook, 13th 
ed. (1998)—acknowledge that you can have an indefinite referent, but both caution 
against this use in ‘academic’ or ‘very formal writing’. 

In a cursory check of several popular guides to correct usage in writing, I found 
that most are silent on the subject. Their major concern with the second person pro¬ 
noun is to distinguish the spellings your and you’re. Some of these mention you with 
indefinite reference in an entry on indefinite one, in which one is recommended as the 
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preferred and more formal substitute for you. Bryan Garner’s brand new The Oxford 
dictionary of American usage and style (2000) does not have an entry for either indefi¬ 
nite you or indefinite one but has an entry distinguishing your and you’re that uses 
indefinite you (356): ‘“In boxing you don’t have that kind of luxury or time. If you mess 
up in a fight or two you’re [read your] career could be over”. (Lancaster New Era [quot¬ 
ing an interviewee])’ 

By far the best treatment in a usage manual—as you might guess—is the entry 
for you in Merriam-Webster’s dictionary of English usage (1989). It explains the indefi¬ 
nite you as related to the addressing function of the second person and points out the 
‘informal effect’ using five citations from oral and written sources from 1957 through 
1987, including one from an interview with Nobel Prize winner William Faulkner. The 
Merriam-Webster reference volume also links the development of you to ‘the effect 
social forces have on the language’ (970). 

Dictionary treatments of indefinite you are brief. The Oxford English dictionary, 
2nd ed. (1989: xx, 766) documents its use in seven citations dated from 1577 through 
1870. Webster’s third (1966 [1961]: 2654) defines it with the synonym one and gives one 
citation. More pertinent, however, is the subentry under you for the phrase for you, 
defined as ‘displayed in his, her, its, or their characteristic nature or behavior’, with the 
citation ‘he’s already forgotten he’s hurt—that’s a child for you’. This is clearly a you 
lacking a specific addressee, and it is clearly not an informal variant of one. Merriam- 
Webster’s collegiate dictionary, 10th ed. (1993:1374) simply gives the synonym one with 
no citation or commentary. American Heritage 3rd ed. (1992: 2072) says,‘Used to refer 
to an indefinitely specified person—one: “You can’t win them all” ’. 

Standard reference grammars all mention you with indefinite reference but invari¬ 
ably give more inches to indefinite one. The best treatment of you in a grammar book 
that I have found is in Douglas Biber et al’s Longman grammar of spoken and written 
English (1999), which is based on a corpus of more than forty million words, both 
spoken and written, divided into four core registers: conversation, fiction, news, and 
academic prose. In distinguishing one from indefinite uses of we, you, and they, the 
grammar observes that the personal pronouns ‘tend to retain a tinge of their basic 
meaning’ (331). Furthermore, the treatment of comment clauses in Biber et al gives 
examples of finite clauses with indefinite you from the news and academic prose reg¬ 
isters that show ‘recognition of the audience’s knowledge or expectations’ [See, for 
example, as you might guess three paragraphs above in this piece of academic prose.]. 
‘In news and academic prose, the use of the subject you in these clauses creates a more 
involved text, overtly attributing an attitude to the reader’ (866). I find this an accurate 
insight into the effect of you by Biber et al. 

However, aside from comments here and there, the semantics of you has not been 
the focus of much analysis. In 1978, M. Stanley Whitley pointed out the shortcomings 
of a syntactic feature analysis of we, you, and they and proposed to overcome them by 
a series of constraints. In response, Dwight Bolinger (1979) offered instead an inter¬ 
pretation based on the process of metaphor. Bolinger pictures the extension of you 
as a kind of metaphoric transfer along two continua—specific to generic and per- 
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sonal to impersonal. You allows speakers both to generalize and to personalize. Says 
Bolinger,‘The interpretation is open, and goes the way that best fits the pragmatics of 
the sentence’ (208). Furthermore, he finds you to be a courtesy device, a way of invit¬ 
ing another to share a viewpoint (205)—which seems to be what is going on in the 
comment clauses cited in Biber et al. 

The reigning expert on personal pronouns in English is Katie Wales, author of Per¬ 
sonal pronouns in present-day English (1996). The cutoff date of her book is 1995. She 
collected data from more than twenty grammar books of the twentieth century and 
also searched the corpora of English usage at University College London and others. 

Wales anchors her treatment of you in the concept of deixis. The first and second 
person pronouns are fundamentally deictic (as opposed to third person, which is 
anaphoric), for their reference is determined in relation to a specific speaker and 
addressee in a specific time and place of utterance. First and second person pronoun 
referents are thus not fixed or stable but shift according to the situation, as partici¬ 
pants take turns to speak. The choice of first person pronoun or second person 
pronoun is essentially context-dependent. The context can be either the immediate 
situation (specific exophoric reference) or the larger cultural context or presupposed 
knowledge (homophoric). Such instability of reference is what has allowed you to 
expand into the numerous and varied acts of language in which it now occurs. 

As the examples that follow indicate, indefinite you in contemporary American 
English occurs in both fiction and nonfiction writing to convey the feeling of imme¬ 
diacy associated with spoken English. In spoken English, indefinite you allows for 
varying degrees of generalization and personalization, and sometimes the referent of 
you is actually the speaker (first person) not the addressee (second person). You is 
often used in situations in which a speaker conveys new information by appealing to 
a common or reasonable bond of experience between the speaker or topic and the 
hearer. Establishing such rapport seems to be a device in particular of academic dis¬ 
course in order to encourage engagement in learning, and you is frequently observ¬ 
able in explanations of abstract and hypothetical subject matter. You that applies to 
everyone’ is often used in aphorisms. The indefiniteness of you is complete when you 
occurs as the grammatical agent of existential verbs like have and got, yet the use of 
you somehow involves the hearer in the situation. 

Probably the main reason that you with indefinite reference has received little 
analysis is that it seldom occurred in writing. The favored point-of-view of writing 
with a serious purpose has traditionally been the third person, which distances the 
writer and reader from the subject matter. A third person perspective gives an air of 
objectivity and removal from the participants in the communication. It is the person 
most suitable for transmitting facts—one of the major aims of written communica¬ 
tion since the Renaissance. In the formal written register in particular, the tone of 
objectivity was considered compromised by use of first and second person pronouns. 
The formal written register was the school model for correct writing, to the extent 
that writers of academic essays customarily referred to themselves in third person 
with phrases like the present writer. From their earliest days of writing instruction, 
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writers developed a style that avoided the first and second person pronouns except 
in personal writing. The infelicitous I, we, or you that did escape an author’s pen was 
dutifully excised by a careful editor. It is only recently that you has been a feature 
of written English. Its increased frequency corresponds to positive associations with 
informality that developed during the twentieth century. 

Part of the modern feel of twentieth century fiction comes from a point-of-view 
that connects the narrator or main character with the reader and gives the feeling that 
the character in the text is in conversation with the reader. The ambiguous use of you 
(as direct address to a particular reader or readers and as a generalized ‘any person 
that will include any potential reader) is sometimes one vehicle for that connection. 2 

In To have and have not (1937), Ernest Elemingway uses you as a framing device, 
beginning with an ambiguous, questioning you that is an address to a reader who is 
‘anyone’ and who is expected to answer yes. 

(3) You know how it is there early in the morning in Elavana with the bums 
still asleep against the walls of the buildings; before even the ice wagons 
come by with ice for the bars? (1) 

The final sentence of the novel likewise uses you, this time embedded in description 
and weighted towards the generic interpretation. 

(4) A large white yacht was coming into the harbor and seven miles out on the 
horizon you could see a tanker, small and neat in profile against the blue 
sea, hugging the reef as she made to the westward to keep from wasting 
fuel against the stream. (261) 

The framing you of J. D. Salinger’s The catcher in the rye (1951) directly involves the 
reader in the speech act that constitutes the novel—Elolden Caulfield’s telling of his 
story. 

(5) If you really want to hear about it, the first thing you’ll probably want to 
know is where I was born, and what my lousy childhood was like, and how 
my parents were occupied and all before they had me, and all that David 
Copperfield kind of crap, but I don’t feel like going into it, if you want to 
know the truth. (1) 

In the final two sentences, the act of telling takes on universal dimensions with the use 
of an imperative construction (which implies a you subject) and then generic you. 

(6) Don’t ever tell anybody anything. If you do, you start missing everybody. 

(214) 
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In the two novels just cited, you seems to slide between ‘addressee’ and ‘anyone’. In 
other works, you seems not to have an addressing function but a generalizing and at 
the same time personalizing one. An example is seen in (7), from J. K. Rowling’s Harry 
Potter and the sorcerer’s stone (1998). 

(7) Mr. Dursley couldn’t bear people who dressed in funny clothes—the getups 
you saw on young people! (3) 

The involvement of the reader that is achieved by the use of you in fiction can be 
viewed as part of the attempt of authors to incorporate and represent colloquial lan¬ 
guage in writing, to make the relationship between writer and reader more akin to a 
personal, face-to-face relationship between speaker and hearer. 

Instances of you without a specific addressee are frequent and hardly noticeable 
in ordinary oral language use, but until recently it was judged a stylistic flaw for writ¬ 
ten English to include colloquial features, except for fiction and very informal genres. 
Lately, however, writers of good, clear prose about serious topics use you. 

From Thomas Friedman’s From Beirut to Jerusalem (1989), winner of the National 
Book Award, comes (8): 

(8) They say it was a cold, drizzly winter night, the kind that you often find in 
Beirut or Jerusalem, where the combination of wind and rain leaves you 
chilled to the bone and wishing you never had stepped outside. (81) 

The opening page of Robin Lakoff’s Language Wars, published by a prestigious 
university press, contains (9): 

(9) The popular use of “linguist” is very different from its professional accepta¬ 
tion. If you tell a layperson that you’re a “linguist,” you are very likely to be 
asked, “And how many languages do you speak?” (1) 

Feature articles in newspapers often take the you perspective, as in 10 from a fea¬ 
ture about pregnancy and in 11 about the winner of the university’s distinguished ser¬ 
vice award. 

(10) After years of trying, you finally conceive and all you hear from people is, 
“It’s about time.” ([New Orleans] Times-Picayune, 31 July 2000, Ci) 

(11) So why quit work you still love? Why say goodbye to people you will 
miss? ([University of North Carolina at Chapel Hill] University Gazette, 

16 August 2000,7) 

You seldom occurs in news articles or editorials, except in direct quotation. If the 
speech reported between quotation marks in newspapers and magazines is a reliable 
record, many speakers use you where the actual referent is the speaker, I. 
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(12) Referring to Smiths 1997 retirement after 36 years, Guthridge said,“I know 
how Dean felt three years ago. You just are exhausted.” ([Raleigh] News 
and Observer, 1 July 2000, A19) 

(13) [Quoting Jennifer Aniston] “It’s scary how Hollywood treats you like this 
completely different person when you’re thin.” ( McCall’s , June 2000,25) 

(14) It has been an exhausting stretch for Doherty. “At night, you collapse.” 

([Raleigh] News and Observer, 19 July 2000, Ci) 

(15) Why work until you’re 83? For Felton, the answer was simple. “It was just 
fun,” she said. “And besides, what else can you do?” ([University of North 
Carolina at Chapel Hill] University Gazette, 16 August 2000,1) 

(16) “You have a routine when you first hit the beach,” he said. “You cast a 
couple of times off the beach. If you get a few strikes, go in ankle deep. 

But if I don’t have to go any deeper, that’s fine with me.” ([New Orleans] 
Times-Picayune, 2 August 2000, D4) 

Other instances of you in direct quotation in newspapers are the generalizing type, 
referring to ‘any reasonable person in the situation. 

(17) “When you run, you feel better about yourself,” Galloway said. ([New 
Orleans] Times-Picayune, 2 August 2000, D5) 

(18) There may be “one-of-a-kind artifacts” still undiscovered, [Morgan] Per¬ 
kins said, “You’ll find gold chains that no one else has seen.” ([Raleigh] 
News and Observer, 20 July 2000, B8) 

(19) “You download music when you just like that one song off a CD,” said 
Marla [Piece], 15. “You’re not going to buy a whole CD if you only like one 
song.” ([New Orleans] Times-Picayune, 28 July 2000, A10) 

Uses of you like those in (12)—(19) are frequent and ordinary in spoken English. 
The contexts in which indefinite you occurs are often not highly informal, and the use 
of you does not make them so. You is much more than an informal variant of one. It 
allows a speaker to use a personal form to imply that an agent is like anyone, includ¬ 
ing the hearer. Rather than occurring in contexts of trivial conversation, you is often 
the choice in the consultative register when the speaker is trying to make the hearer 
understand something unfamiliar, new, or complex. Examples (2o)-(24) were uttered 
by public figures within the structured but unscripted format of public affairs televi¬ 
sion broadcasts. 

(20) Just because someone would break the law doesn’t mean that you 
shouldn’t make the law. (Steve Forbes, Meet the Press, 23 January 2000) 

(21) It fundamentally changes the way you manage a conflict. (Colin Powell, 
CNN 20 Live, 2 June 2000) 

(22) You need to have a chain of command. (Leon Panetta, ‘Role of the White 
House Chief of Staff’, C-Span, 15 June 2000) 
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(23) We [the Supreme Court] say you can’t have an undue burden. (Linda 
Greenhouse, Washington Week in Review, 30 June 2000) 

(24) [The Russian bear might not be out there in the woods any more, but the 
Chinese dragon is. (Steve Roberts)] How do you run against the Chinese 
dragon? (John Harris, Washington Week in Review, 7 May 1999) 

The Michigan Corpus of Academic Spoken English [micase] (Simpson et al. 1999) 
is a particularly rich source for you in situations in which the transmission of infor¬ 
mation is the goal of the communication, micase makes available on the web writ¬ 
ten transcripts of a variety of academic events at the University of Michigan; the 
current corpus contains twenty-three events, totaling 222,100 words. In the micase 
transcripts are hundreds of occurrences of you not directed to an addressee. Exam¬ 
ples (25)—(28) are drawn from different events. 

(25) The history of science has always been that you have to be first, you have 
to make that discovery first, and that sets up an incredibly competitive 
climate, and atmosphere, (micase, Women in Science Panel, 17 October 
1998) 

(26) You have to be able to put names on the organisms, (micase, Interview 
with Mycologist, 4 January 1996) 

(27) You [pregnant Inuits] have to be airlifted to the hospital, (micase. Medical 
Anthropology Lecture, 25 February 1998) 

(28) On the other hand, if you’re on an island where your primary diet is 
going to be these little insects that bore in, isn’t it great that you have 
this long thin beak? (micase, Psychology 111 Lecture, 22 September 1998) 
[Obviously it is a bird that has the beak, not the students listening to 
the lecture.] 

The tendency of you to widen to refer to ‘everyone’ is shown by the number of 
proverb-like statements that use you, as in 29 and 30. 

(29) You can lead a horse to water but you can’t make him drink. 

(30) You gotta know when to hold 'em and know when to fold 'em. 

In the consultative register, you is often combined with existential have or got, as in 

(31)—(33). In these instances the reference of you has moved as far as possible from the 
particular addressee end of the continuum to generalize as ‘existence’ or ‘topic’. 

(31) You have open borders in Europe now. (Georgie Anne Geyer, Washington 
Week in Review, 11 February 2000) 
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(32) .. .in a lake such as Douglas Lake that stratifies and you have a thermo- 
cline. (micase, Biology and Ecology of Fishes Lecture, 21 July 1998) 

( 33 ) You’ve got three million Palestinian refugees. (Martha Raddatz, Washing¬ 
ton Week in Review, 21 July 2000) 

Sentences (31)—(33) can be paraphrased with there is/ there are in which another 
deictic, there, plays an equivalent grammatical and semantic role to you, (34)—(36). 

(34) There are open borders in Europe now. 

(35) .. .in lakes such as Douglas Lake that stratifies and there is a thermocline. 

(36) There are three million Palestinian refugees. 

The genitive form your, as in (37)—(38) from public affairs television programming, 
requires scrutiny beyond the space limitations of this paper. Because Wales discusses 
your in detail and traces its semantic development to ‘typical member of a class’(i5), I 
shall not treat your further. 

(37) These guys are not your classic teamster thugs. (Glen Simpson, Washing¬ 
ton Week in Review, 19 September 1997) 

(38) Many of your northern species—they travel as far south as North Caro¬ 
lina. (Betsy Bennett, North Carolina People, 31 March 2000) 

In summary, in contemporary American English indefinite you is frequent and 
unobtrusive. Although it occurs more often in spoken English than in written Eng¬ 
lish, its use does not mark an utterance or a text as highly informal, casual, or insig¬ 
nificant. On the contrary, the indefiniteness of reference actually favors the use of you 
in statements of principles of general applicability. In many contexts, despite indefi¬ 
nite reference, the traces of the personal in you help to create an immediacy between 
speaker and hearer that promotes communication. 

As a postscript, I acknowledge choosing the title of this essay more for wit and 
whimsy than for scholarly rectitude. This study does not provide sufficient evidence 
for using the adjective fashionable to describe the rise or current widespread use of 
indefinite you. How, why, when, where, and to what degree indefinite you is increasing 
and whether or not fashion is a factor are matters for further study, which I intend to 
fashion. 


In order to avoid an unnecessarily cumbersome reference section, I have placed biblio¬ 
graphic information needed to locate the numbered examples in parentheses at the end of 
each example. 

I am grateful to my colleague and office neighbor William R. Harmon, author of Macmil¬ 
lan’s A handbook to literature, for the examples from To have and have not and The catcher 
in the rye. I also appreciate the observations of an astute anonymous reviewer that led me 
to rewrite the final paragraph of the essay. 
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nominalizations present many paradoxes: By Zipf’s law, longer words should be 
rare, but in the Longman-Lancaster academic corpus, 4.4% of the words are nominal¬ 
izations (Biber et al 1998:60). Teachers teach against them, but use them extensively in 
their own publications (Hartnett 1998). They abound in bureaucracies, which are now 
trying to avoid them. They played a big role in developing sciences, but new terms in 
particle physics are not nominalizations. Their use maybe declining today. 

Systemic Functional linguists call nominalizations grammatical metaphor (abbre¬ 
viated to G.M.), Whorf called them by the limited term stativations, and some lin¬ 
guists call them non-zero-derived deverbal nouns. I will use the term nominalizations 
now to refer only to nouns made from verbs by changing the root of a verb (usually 
an old irregular one, such as sing/song ) or by adding a suffix other than the gerundive 
-ing, which all verbs have. The term nominalization exemplifies the process. 

To clarify the influences on nominalization, I begin with their patterns of use in 
a language much different from English, Thai. Then I’ll move historically from Greek 
through the Enlightenment to industrialization and its repercussions and then to pat¬ 
terns of nominalization in current publications. I will discuss written styles in two 
advancing modern sciences: neuroscience, which uses traditional heavy nominaliza¬ 
tion for certain audiences, and quantum mechanics, which seems to be moving in the 
opposite direction. This evidence supports the ten factors that I list. 

1 . NOMINALIZATION INCREASES WHEN CHANGING TIMES CAUSE INCREASED DISCUS¬ 
SION OF THEIR TOPICS; IT IS NOT NECESSARILY A CHARACTERISTIC OF THE TOPICS 

themselves. In a study of the 700 years of Standard Thai, Amara Prasithrathsint 
(1997,1994) found nominalizations since its beginnings, carved in stone in 1283. She 
saw that nominalization, usually with prefixes, increased during periods of rapid 
change, such as during the reign of King Rama V (1868-1910) when modernization 
abolished slavery and reformed education and administration. When things settled 
down, nominalization decreased. It increased again during the political revolution 
in the 1930s, when Thailand became a democracy, and later it decreased. Nominal¬ 
izations were not used equally in all fields: psychology used most (23%), then sociol¬ 
ogy (18%), science and philosophy (10% each), statistics (8%), mathematics (7%), and 
medical science (only 5%). She concluded that Thai temporarily increased its use of 
nominalization not because of the topics themselves but because of increased discus¬ 
sion of them. This is the first factor: language changes in times of change. 
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2 . ONE REASON FOR CREATING NOMINALIZATIONS SEEMS TO BE SIMPLE NECESSITY, 
FOR WRITERS WANT TO DISCUSS MORE THAN PEOPLE, PLACES, AND THINGS. A Spurt in 

science and philosophy occurred when ancient Greeks incorporated ideas from civi¬ 
lizations to the south and east. They adapted their verbs by adding morphemes that 
make them mean the actor, the resulting deed or thing done, or the action (Halliday 
1998:197). For example, to theorize about the motion of the earth and stars, Greek 
science created from move its words for mover, then moving, movement, and finally 
motion, which is not an actual entity but is treated as stable, measurable, and able 
to be modified and participate in further processes. Greek science and mathematics 
needed ordered sets of abstract technical terms like these; the need for new terms is 
the second factor. 

3. NOMINALIZATION IS EFFICIENT, ESPECIALLY WHEN TIME, ACTOR, AND PROBABILITY 

are irrelevant. Western science changed little until the period of the Enlighten¬ 
ment, when Newton challenged the prevailing attitude and revolutionized science by 
beginning to base conclusions on observations. In 1704, he used a verb, refracted, first 
and then a nominalization of it, Refraction. 

(1) Light will not be refracted enough, and for want of a sufficient Refraction 
will not converge to the bottom of the Eye. (Halliday 1998:201) 

This sort of nominalization is easy to understand because it grows out of the text, 
a process that Halliday calls logogenesis. It is a natural progression from reporting a 
factual observation of an event to naming the process. A third reason for using nomi¬ 
nalization is its efficiency. 

4. NOMINALIZATIONS ALLOW KNOWLEDGE TO ACCUMULATE BY BUILDING ON PREVI¬ 
OUS INFORMATION, COMPRESSED, ORGANIZED, AND PACKAGED AS A BASE FOR DEVEL¬ 
OPING new knowledge. In the introduction to his ground-breaking work, Elements 
of Chemistry, Lavoisier recognized ‘the impossibility of separating the nomenclature 
of a science from the science itself: ...as ideas are preserved and communicated by 
means of words, it necessarily follows that we cannot improve the language of any 
science without at the same time improving the science itself; neither can we, on 
the other hand, improve a science, without improving the language or nomenclature 
which belongs to if (i79o:xiv-xv). Then when the life sciences developed in the 19th 
century, biologists also needed to define and exemplify as they constructed new tax¬ 
onomies. Nominalizations helped them classify their observations of flora and fauna. 
Chemistry and biology teachers today continue to emphasize the vocabulary of tax¬ 
onomies because they continue to believe, ‘Whoever controls the language controls 
the science’. 

Later, as the social sciences developed, they also needed nominalizations to inter¬ 
pret changes in society. Nominalizations organize known information, freezing it for 
interpretation in theories or hierarchies (Halliday & Martin 1993:199-233). They yield 
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a noun phrase that can be used in a clause to express a great deal more than is explicit. 
Thus they allow a smooth flow of new information when it is re-used as given. The 
fourth factor is the advantage of using previous information as a base for new knowl¬ 
edge. 

5. NOMINALIZATIONS CAN ORGANIZE AND SUMMARIZE INFORMATION RHETORICALLY 
IN WRITTEN COMPARISONS, SEQUENCES, CLASSIFICATIONS, AND MANY TYPES OF GEN¬ 
ERALIZATIONS. Nominalization is most frequent in writing, less common in standard 
speech, and rarest in non-standard varieties. Conversation occurs in real time, usu¬ 
ally face-to-face, and without much time for reflection or for collecting separate ideas 
into generalities or classification systems. In contrast, much writing deals with dif¬ 
ferent topics and is slowly and deliberately created with a different logical structure. 
Nominalization can reorganize expository writing rhetorically, instead of in the real- 
world’s time and location sequence. 

History and literature need nominalizations too, to construct their abstractions 
and generalizations. Although literature needs verbs to get readers interested in the 
story of what happened, later, after thoughtful consideration, conclusions need effi¬ 
cient nominalizations to foreground results. In some situations, separate clauses with¬ 
out nominalization are essential to provide poignancy, but in others, nominalizations 
are required, for full clauses would impede the flow of information (Longacre 1993: 
327). Because nominalizations economically generalize in evaluations, introductions, 
summaries, and conclusions, they appear in many areas beyond academic fields. They 
serve in law, administration, and bureaucracies because of the fifth factor, the ability 
of nominalizations to organize information rhetorically. 

6 . NOMINALIZATION ALLOWS ITS USERS TO BE VAGUE WHEN THAT IS ADVANTAGEOUS 
TO THEM, BUT VAGUENESS DISADVANTAGES SOME AUDIENCES, AND ITS OVERUSE HAS 

led to a backlash. As technology and industry developed, so did management and 
governmental bureaucracies, and they adopted the language of science, gaining its 
prestige. That prestige led bureaucrats to misuse and overuse nominalizations, either 
from habit or carelessness, to make an impression, or to manipulate readers. Some 
lawmakers might have felt that they were safer relying on familiar, traditional word¬ 
ings. As bureaucrats wrote more, they began to talk in their written style. Now, when 
scientists and managers have become so collaborative that they do more talking with 
colleagues than writing, their written and spoken styles are merging, and nominaliza¬ 
tion occurs in both. 

A nominal style can limit the amount of information revealed. It allows speakers to 
generalize without stating or knowing specifics. It also can de-emphasize or hide rel¬ 
evant information. Moreover, although nominalizations turn processes into objects, 
nothing requires treating them objectively. Nominalizations are generalities, and it 
has been said that all generalities are false, including this one. The vagueness of nomi¬ 
nalizations is such a great disadvantage to the audience that a backlash eventually 
developed, opposing the style called gobbledygook. Now our government has issued 
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guidelines and is making an effort toward plain English in documents for the public. 
Many publications, such as insurance policies and health guidelines, have become 
easier for consumers to understand; improvement is possible. The sixth factor is the 
vagueness of nominalizations, which can benefit the users but harm their audiences. 

7. BECAUSE USE OF NOMINALIZATIONS REVEALS AN ASSUMPTION THAT THE AUDIENCE 
UNDERSTANDS THEM, THEY INDICATE GROUP MEMBERSHIP THAT INCLUDES SPEAKER 

and audience. Although technology and business can seem impersonal, nominal- 
ization convinces readers of their understanding and thus establishes the cohesion 
of group identity. Insiders need their own efficient jargon and standard technical ter¬ 
minology, and informed audiences dislike being talked down to. Technical jargon 
creates a field. Because nominalization requires a knowledge of the field, it distin¬ 
guishes the expert from the uninitiated. It indicates status, authority, maturity, and 
other aspects of the writer’s identity (Ivanic 1998:135, Hartnett 1999a). Heavy nomi- 
nalization makes a text sound formal and impersonal, often with a learned touch. 
It can be interpreted as revealing the writer’s assumption that readers understand 
the content, implying group identity and perhaps boosting the egos of the readers. The 
seventh factor is that nominalizations assume understanding and therefore reflect 
the group membership of audience and speaker. 

8. ALTHOUGH THE MEANING OF NOMINALIZATIONS IS INHERENTLY DIFFICULT, IT 
MUST BE MADE CLEAR TO AUDIENCES WITH DIFFERENT NEEDS. Because nominaliza- 

tions assume previous knowledge, they can increase the difficulty of comprehension. 
They leave implicit the experiential meanings that beginning learners most depend 
on (Halliday & Martin 1993:41, 250-57). Nominalizations presuppose so much prior 
knowledge that they are difficult for students to comprehend; this may be why many 
educators dislike them. But unpacking nominalization makes texts longer, although 
still not clear, because it gives all details equal importance and may still hide relation¬ 
ships. Brevity is both advantageous and problematic. 

Context and modifiers may be insufficient to clarify what the nominalizations 
refer to: the actor, action, recipient, resulting product, or more abstractly, the outcome 
or a quality. For example, the words corrosion and classification can mean either pro¬ 
cesses or their products, but information means only the product, not the process. 
Animal protection can mean using guard dogs or saving whales. Meanings can change 
unpredictably. The suffix -er (player, speaker), for example, usually refers to an actor 
or agent, but laughter and prayer are products, not actors. Even a native reader cannot 
deduce meaning simply from the form. A conception can be an idea, its beginning, a 
biological process, or its result. 

Furthermore, the original speakers and writers have their own intended meanings 
before they express them, and so the meanings of their nominalizations are ‘never 
simply additive’, but are more specific and applicable to a narrower field (Bolinger 
1968:55). When scientists try to avoid creating too many new words, they may slant, 
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Publication, Date (1998), 
Price, and Number of 
Articles 

Sentences 

in First 
Paragraphs 

Total of 

Finite 

Verbs 

Total 

Nominal- 

izations 

Nominal- 

izations 

as % of 
Verbs 

Science: May 8, $7,18 





• Specialist Reports 

117 

193 

178 

92% 

• Non-Specialist 

64 

125 

55 

44% 

Reports 





Modern Maturity, 

68 

125 

57 

46% 

Mar-Apr, $.40,19 





Popular Mechanics, 

79 

122 

52 

43 % 

June, $2.95,18 





PC World, June, $2.95,18 

37 

64 

27 

42% 

Family PC, June, $1.99,9 

28 

66 

28 

42% 

Essence, June, $2.75,8 

100 

89 

28 

31% 
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Specialized 

Non-Specialized 

Total Words Analyzed 

687 

681 

Sentences 

35 

31 

Complete Finite Verbs 

67 

80 

All Nouns 

184 

174 

Nominalizations 

41 

25 

Prepositional Phrases 

84 

66 

• Objects of Prepositions that 



are Nominalizations 

18 

3 

• Sequences of 2 or more PPs 

19 

11 

• Nominalizations in Sequences 



of 2 or more PPs 

7 

1 


Table 2. Analysis of two science reports of the same research. 

science magazines (PC World, Family PC, and Popular Science), near the 46% in 
Modern Maturity for readers over age fifty but way above the 31% in the extensively 
illustrated beauty magazine, Essence. 

When scientists write for other scientists, they usually try to be convincing by stay¬ 
ing within the paradigm, but science writing for the public focuses on novelty (Fahn¬ 
estock 1986). Therefore I examined reports for two different audiences concerning the 
same brain surgery (Baynes et al. and Strauss, both in Science 8 May 1998) to compare 
the styles and contents appropriate for different audiences and purposes. 

To make the specialist report as similar as possible in content and size to the article 
for educated non-specialist readers, I omitted much of the lengthy narrative of proce¬ 
dures and test results from the middle of the specialist report (Hartnett 1999b, 1999c). 
Table 2 reflects the more nominal style of the article for specialists. Neither article had 
many nominalizations that are technical terms (such as lateralization, 10 total), for 
most of the technical terms were adjectives, verbs, or ordinary nouns used in a special 
sense, such as hemisphere. Most of the nominalizations in this very limited sample 
were clear in context, and only one related to previous use as a verb. 

Although Table 2 shows that the specialized material has only 5% more nouns (184 
vs. 174), more of them are nominalizations (41 vs. 25, which equals 22% vs. 14% of all 
the nouns and amounts to 1.2 vs. 0.8 nouns per sentence and twice as many nominal¬ 
izations per verb, .62 vs. .31). 

Differences in the syntactic roles of nominalizations also are significant (p = .01), 
for the report for specialists has more prepositional phrases (84 vs. 66), and six 
times as many nominalizations in prepositional phrases as the parallel report for 
non-specialists (18 vs. 3). In the specialist report more prepositional phrases occur 
in sequences of two or more: the 19 sequences there include seven nominalizations, 
while the 11 sequences in the non-specialized material used only one nominalization. 

Prepositional phrases pack more lexical content into clauses. This heavy modifica¬ 
tion also simplifies the syntactic complexity without reducing the lexical density (the 
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ratio of content words to grammatical terms). In spoken English, lexical density is 
only about two lexical words per clause (Halliday 1998: 207). In contrast, both of the 
science articles that I examined have very high lexical densities of 10.8 content words 
per average sentence of 20.7 words, for they both are written for scientifically trained 
readers. In both, 52% of their words serve as nouns, verbs, adjectives, or adverbs, and 
48% work as prepositions, conjunctions, articles, or modal auxiliaries of verbs. Heav¬ 
ily modified nominalizations appear in simple sentence structures for the specialists 
who know the vocabulary, while other readers get fewer nominalizations but more 
complex syntax. These differences illustrate the eighth factor, that lexical and syntactic 
choices vary to accommodate the needs of the readers as well as the purposes of the 
writer. 

9. BECAUSE NEW INFORMATION IS EASIER TO COMPREHEND AT THE END OF SEN¬ 
TENCES, NOMINALIZATIONS THAT PRESENT NEW INFORMATION AT THE BEGINNING OF 
SENTENCES ARE DIFFICULT TO COMPREHEND AND IMPOSSIBLE TO QUESTION. Syntac¬ 
tic roles influence where information appears in the sentence. New information is 
easier to comprehend after its context introduces it, so it reads better not as subject 
or theme but in the new information position toward the end. The beginnings of the 
sentences in the two Science reports differed significantly, as illustrated in the struc¬ 
tures of the fourth sentences of each report. 

(2) By studying an epileptic patient whose brain was surgically divided to 
control her seizures, Kathleen Baynes, a cognitive neuroscientist at the 
University of California, Davis, and her colleagues found that the centers 
for speech and writing, long thought to be in the same side of the brain, 
can reside in different hemispheres. 

(3) Examination of split-brain patients both before and after their surgery 
is another tool for the unmasking of such processes. [Author’s note: the 
processes were described two sentences earlier ;] 

In the report for non-specialists, the fourth sentence, (2), has fifty-two words; it 
begins with fourteen words describing the method with a prepositional phrase whose 
object is described in a passive clause; then it names the researcher as grammatical 
subject. After identifying her, her institution, and her co-researchers, it continues with 
the main verb followed by a clause whose subject is followed by a prepositional phrase 
and a participial modifier with two more prepositional phrases before the verb and 
another prepositional phrase. 

In the specialized article, the nineteen-word fourth sentence, (3), introduces the 
findings with a nominalization as subject. This sentence exemplifies a sentence struc¬ 
ture that is common in science writing: two heavily modified noun phrases, often 
nominalizations, connected with a verb that relates them. If the specialized example 
sentence were stripped of the fifteen words in its four prepositional phrases, it would 
read simply and insignificantly. 



110 


CAROLYN G. HARTNETT 


(4) Examination... is another tool... 

In its entirety, this sentence illustrates how nominalizations attract modifiers of all 
types, especially noun adjuncts and prepositional phrases, in a relatively simple sen¬ 
tence structure. In contrast, the nominalizations for non-specialists in (2) are the 
unmodified common words seizures and speech; both are used as objects, not subjects. 
These sentences reflect the importance of syntax and word order in the ninth factor, 
balancing the difficulties of comprehending nominalizations that present new infor¬ 
mation at the beginning of a sentence. 

10. ALTHOUGH WE ARE ACCUSTOMED TO INTERPRETING THE UNIVERSE AS A PLACE 
WHERE THINGS RELATE TO OTHER THINGS, AND NOMINALIZATION INTERPRETS OR 
RECONSTRUCTS IT AS SUCH, WHEN OUR VIEW CHANGES, LANGUAGE MUST CHANGE, 

but nominalizations are not the only alternative. Nominalizations gave sci¬ 
ence the advantage of discussing processes and relationships as things that are treated 
as stable objects and as the basis for further knowledge and theory. Now we have a 
paradox in the science with the greatest changes in the last century. Although the 
language of neuroscience and most other sciences reflects a fixed, determinate real¬ 
ity of objects to be observed and classified, physics moved ‘from object to process, 
from absolute to relative, from determinate to probabilistic, from stability to flow’ 
(Halliday & Martin 1993:20). Nobel Prize-winner Steven Weinberg writes,‘quantum 
mechanics has transformed the very language we use to describe nature: in place 
of particles with definite positions and velocities, we have learned to speak of wave 
functions and probabilities... matter has lost its central role’ (1992:3). How can physi¬ 
cists use nouns when ‘matter has lost its central role’? They describe basic particles as 
vibrating strings of positive and negative fields of energy. They accept the principle 
that their knowledge is always incomplete, which follows from the Heisenberg prin¬ 
ciple of not being able to observe something without changing it (such as the speed 
and location of a particle). 

Quantum physicists see order in the universe but accept flux and use processes 
that they call fuzzy logic and chaos theory. Instead of creating new nominalizations for 
processes or their products, they use humor to name the characteristics of particles. 
They discuss anti-bottom quarks with flavors of charm and strangeness. Some of the 
whimsical new terms are short combinations of morphemes, such as gluons that hold 
quarks together. Quantum chromodynamics classifies with the names of nonexistent 
colors (Gribbin 1998). These are the formal terms used in specialists’ reports, terms 
defined seriously in dictionaries and labeled not colloquial but physics. Elsewhere, 
imprecision is becoming acceptable. Statistical sampling is considered more accurate 
than an actual census count. A tendency toward imprecision might be a temporary 
stage for teenagers who go Tike uhm, y’know, well, anyway, like, so, y’know, whatever. 
It’s awesome’. 

Our grammar construes a universe of things and relations, where objects relate 
to or act on other objects. Nominalization interprets the world this way (Halliday & 
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Martin 1993:220). Our grammar imposes grammatical categories, so when the things 
are events, we adjust by turning verbs into nouns. However, our view can change and 
make us change the categorization. The tenth factor is our changing view of the uni¬ 
verse, which in turn changes our language. 

Another change seems to be developing, making science more commonplace and 
democratic. Scientists are human beings who need public support for funding 
and who enjoy having non-scientist friends understand or at least appreciate their 
work. Halliday has predicted a return to the more democratic use of verbs, and he 
has also reminded us of the varying categories in non-Indo-European languages 
(1998:186-88). Language is changing, but not necessarily increasing nominalization. 
Linguists cannot tell an advancing science what kind of language to use, but they can 
observe it. 
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introduction. Lexicalization is a process whereby new lexemes are formed. The 
process is usually seen as historical, and the resultant lexeme becomes a synchronic 
unit. A certain degree of componential opacity is associated with a fully lexicalized 
item. (Heine et al. 1991:95) To the extent that speakers still view a lexeme as composed 
of its parts, the lexicalization process may be considered less than complete within a 
functional theoretical framework. Such an analysis is consonant with certain typolo¬ 
gies, but flies in the face of synchronic evidence in languages where componential 
transparency is the norm rather than the exception. 

For example, it is well known that Chinese has compound lexemes (Li & Thomp¬ 
son 1998). However, there is evidence from brain damaged patients that a compo¬ 
nents status as verbal or nominal can affect access to same within a fully lexicalized 
compound. So Mandarin ugfg chi fan ‘eat’, while enjoying full lexeme status, is more 
likely to be retrieved as just §§/«« rice’ by patients with damage to Broca’s area, and 
as n£ chi ‘eat’ alone by those suffering from damage to Wernicke’s area (Bates & Chen 
1991; Zhang ms.). On the other hand, speakers of English often have difficulty access¬ 
ing the componential makeup of words whose root morphemes have independent 
lexemic status: e.g., rooster [< roost + er]. In Hebrew most morphemes are bound, 
because consonants carry mostly lexico-semantic information, while vowels code 
mostly grammatical and categorization data. Despite the highly dependent nature of 
Hebrew morphology, speakers find it extremely transparent. 

The purpose of this paper is to determine the implications for lexicalization from 
the relative opacity or transparency of morphemes in typologically distinct languages. 
Lexicality will be defined independently from componential opacity and morpholog¬ 
ical dependence. Opacity will be subcategorized as circumstantial (objective) versus 
psychological (subjective). Correlations between and among opacity, dependence and 
lexicality will then be tracked. 

This paper will use examples from Hebrew, English and Chinese to determine how 
overall systematicity in lexical derivation affects speakers’ strategies in processing lex¬ 
emes. The actual transparency/opacity of a particular item may be seen to be less 
important than whether a speaker expects lexemes to be componentially transpar¬ 
ent. We will consider whether systematicity in derivation primes a subject to use 
componential analysis even where it is less productive. We will explore whether lack 
of systematic derivation overall deters a speaker from using a strategy of componen¬ 
tial analysis even on lexemes whose components are well known to the speaker. The 
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psychological underpinnings of opacity will be explored within a framework of typo¬ 
logical comparison. 

l. the theoretical background. Lexicalization, as a linguistic label, emerges from 
the functional linguistic school of thought, and it is often paired with the term gram- 
maticalization. (Heine et al. 1991; Hopper & Traugott 1993; Traugott & Heine 1991). 
As historical processes, both lexicalization and grammaticalization often result from 
reanalysis. However, while grammaticalization gives rise to new productive patterns 
that add to the grammar of a language, lexicalization produces seemingly isolated 
additions to the lexicon. Regularity, productivity, and transparency as components 
are therefore attributed to grammaticalized items, whereas idiosyncracy, randomness 
and opacity are expected of lexicalized words. 

According to Heine et al. (1991:95), the difference between the two terms can be 
summed up as follows: Assuming that both involve some kind of ‘idiomization’, the 
latter may be said to be morphologically productive in the case of grammaticalization 
but not in that of lexicalization.’ 

The above is representative of statements about lexicalization in grammaticaliza¬ 
tion literature as a whole, and it carries with it some subtle implications that are worth 
exploring. There is also an almost imperceptible shift between a diachronic use of the 
term to a synchronic use that has far reaching implications. An item is often seen to be 
more lexicalized if its componential makeup is relatively opaque, and less lexicalized 
if it is relatively transparent. This implies that full lexemic status ought to be accorded 
only to items whose derivation is synchronically unreadable. If we followed such logic 
to its ultimate conclusions, we would decide that a monomorphemic word is the most 
lexemic of all words. However, there are serious drawbacks to this approach when 
applied crosslinguistically to languages belonging to contrasting typologies. 

In a language such as Hebrew, there are almost no monomorphemic words, since 
vowels and consonants play complementary roles in word formation. The majority 
of vowels found in a Hebrew word code derivational and inflectional contrasts, while 
a majority of consonants code lexico-semantic information. Morphemes are discon¬ 
tinuous, and every syllable carries parts of more than one morpheme. But despite the 
extremely fusional nature of the language, componential transparency is the norm, 
rather than the exception. 

In fact, in Modern Hebrew, non-linear derivation patterns, involving the use of 
discontinuous roots and templates to form new words, are still extremely productive. 
(Bolozky 1999). 

Mandarin is known as an isolating language, but it has a large stock of complex lex¬ 
emes. (Li & Thompson 1998). The dependence level of these lexemes is low, in that they 
are not phonologically affixed and there is no reduction or resyllabification involved. 
In this sense, the morphologically complex lexemes of Mandarin are quite different 
from those of Hebrew. But both Mandarin and Hebrew enjoy a high level of morpho¬ 
logical transparency. There is evidence from brain damaged speakers of Mandarin that 
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components are accessed according to their grammatical categorization, rather than 
the grammatical category of the word as a whole (Bates & Chen 1991). 

Multimorphemic words in English are more tightly fused than those in Mandarin, 
but less so than in Hebrew. Multisyllabic lexemes in English are common, but there is 
by no means a one-to-one correspondence between morpheme and syllable. Speakers 
of English have great difficulty analyzing lexemes into their component morphemes. 
Even in such common words as heal and healthy, speakers of the language require 
instruction in its history in order to identify the common root, according to the intro¬ 
duction to a leading textbook on the history of English (Pyles & Algeo 1993.2) In 
other words, commonly used English lexemes are often componentially opaque for 
the average speaker. 

The implications for the concept of lexicalization from the above observations of 
Hebrew, Mandarin and English are as follows: 

(1) Lexical status is not necessarily dependent on degree of fusion. 

(2) Lexical status is not necessarily dependent on opacity or semantic bleaching 
of components. 

(3) Opacity and fusion are independent of each other. 

The significance of these initial observations is as follows: if lexical status can be 
established independently of both fusion and opacity, then lexeme formation (i.e., 
lexicalization) cannot be defined as a necessarily‘morphologically unproductive’. 

Heine et alls statement (1991:95) might suggest that componential opacity is some¬ 
thing that indicates a high level of lexicalization, where the term lexicalization is 
defined as incremental. The higher the opacity, the higher the degree of lexicalization. 
But if such an interpretation were used, most words in a language such as English 
might be judged to be more lexicalized than those in Hebrew and Mandarin. What 
would be the implications of such a finding? 

One possibility would be to suppose the individual words have undergone a 
greater process of semantic erosion and that opacity results from such bleaching. But 
an alternative explanation is available: it is not so much that individual words have 
had their morphemes bleached, but rather that the speakers of the language, due to 
lack of overall systematicity in lexical patterning, have been conditioned to ignore 
componential analysis as a useful strategy. 

This paper proposes to probe the issue by comparing psychological componential 
opacity (the psychological ability of speakers to analyze components) with circum¬ 
stantial componential opacity (whether the requisite cues are synchronically there in 
the language to allow for such analysis). 

2 . PSYCHOLOGICAL COMPONENTIAL OPACITY VERSUS CIRCUMSTANTIAL COMPONEN¬ 
TIAL opacity. An extreme example of circumstantially opaque derivation in English 
is hussy [<housewife ]. (Pyles & Algeo 1993:265) The phonological reduction and 
fusion that occurred over time have rendered the lexeme morphologically opaque 
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and therefore monomorphemic. Speakers of English cannot identify the component 
parts of the word, because no one could do so without knowing its history. The infor¬ 
mation is simply not available synchronically within the language as a whole. There¬ 
fore the opacity is circumstantial, not psychological. 

An example of psychological opacity in English can be found in the lexeme rooster 
[<roost +er]. Here, the morpheme roost has separate lexemic status, and the suffix -er 
is productive throughout the language. Speakers synchronically have enough infor¬ 
mation to easily analyze the word. If they fail to recognize its components, this is due 
to psychological opacity. (That many do fail to recognize the components has been 
related to me by a number of native speakers of English who first learned of this 
word’s composition in a morphology class.) 

The methodology proposed has at its core the comparison of psychological versus 
circumstantial opacity. If a lexical item is both circumstantially and psychologically 
opaque, there is nothing surprising in such a situation. Likewise, if a lexeme is both 
circumstantially and psychologically transparent, there is nothing unusual about that. 
However, if there are lexemes that are psychologically opaque but circumstantially 
transparent, such a finding would have theoretical significance. Moreover, if a lexical 
item is circumstantially opaque but psychologically transparent, this would be a very 
interesting find. 

3. PROPER NAMES AND THEIR IMPLICATIONS FOR LANGUAGE-WIDE COMPONENTIAL 

transparency. It might be supposed that opacity or transparency of lexemes would 
be largely determined by the necessity of engaging in creative acts of word formation. 
Since most speakers seldom have occasion to coin new words, their skills in the reverse 
process of analyzing the words with which they are presented might be atrophied. 

The generative framework, for instance, equates creativity with the ability to generate 
novel utterances. (Chomsky 1966). The focus is on the speaker, not the hearer. Is trans¬ 
parency directly related to productivity? Some of the literature might suggest this. 

Bolozky (i999:xi) tells us in his preface: ‘Somewhere in the heart of every speaker 
of Israeli Hebrew, placed between a love of oriental food and a passion for politics, 
there is a constant urge to form new words’. He then proceeds to test productivity by 
a series of experiments in which speakers are requested to coin new words, and the 
templates and roots which they use are analyzed for productivity. 

The approach of this paper is quite the reverse. It is speakers’ comprehension of the 
componential makeup of commonly used lexemes that concerns us here. Neverthe¬ 
less, the one time when most ordinary people have occasion to form a word is in the 
process of naming. A proper name is the most extreme form of lexicalization avail¬ 
able. Since many speakers do engage in naming, they must also have some skill in 
deducing the meanings of names given by others. A brief examination is in order, 
in the spirit of a crosslinguistic comparison. 

The following quoted material is of a non-academic nature, and it is presented not 
in order to prove any linguistic assertions contained therein, but to demonstrate the 
attitudes toward names displayed by ordinary lay people. In a short article, intended 
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for Chinese speaking students of English, Oscar Chiang explains: ‘The Chinese are 
more particular than most Americans when choosing personal names. Chinese 
names often have special meanings, while most American names do not. Chinese par¬ 
ents or grandparents usually select names for their children or grandchildren with 
great care in order to create a certain expectation or ideal for the child to follow. On 
the other hand, American parents adopt the names of relatives (and even friends) for 
their children, as a way of honoring that person’. (Chiang 1999). 

The facts on which the above lay perception is based are as follows: most names 
used by English speakers are not English words. Most are borrowings from other lan¬ 
guages, and even many whose origin is Anglo-Saxon are not synchronically recogniz¬ 
able English words. As a result, English speakers have been conditioned to think of 
names as semantically empty labels. The attitude is so systematic for English speakers 
that it colors their perception even of names that are circumstantially transparent in 
modern day English. 

There is evidence that at least some speakers of English, while being perfectly 
aware of the meaning of the word rose, appear to be concurrently unaware that the 
name Rose also means ‘rose’. 

In a web search of name dictionaries, one such dictionary listed the name Raisa 
with the definition ‘rose’. The name Rose, however, was defined as: ‘Of the Rose Blos¬ 
som’. In other words, the English speakers who compiled that name dictionary were 
not comfortable with having a name mean the same as a common word with which 
it is identical. At most, for many speakers of English, a name can have an etymology, 
but not a lexical meaning. 

Since ‘meaning’ cannot be prescribed, and native speakers of a language are the 
only ones who determine the meaning of words in their language, another way to 
express this is to say that the name Rose, in the minds of at least some speakers of 
English, has no meaning directly associated with that of the common noun rose. 

This assertion may seem slightly controversial, since many English speakers who 
are linguists have etymologically based perceptions of lexical meaning associated 
with word origin. Many literate speakers of English are aware that names have 
‘meanings’—without knowing what those meanings are. The meanings form a set of 
arcane knowledge. English speakers often consult name dictionaries to find out what 
their names ‘mean. What they are really looking up is etymologies, true or fabricated. 

The same desystemetizing influence that we find in the proper noun inventory of 
English speakers can also be found in the list of common nouns. Borrowed words in 
English have elaborate morphology which is fairly systematic, but unintelligible to the 
untrained speaker. Most Germanic common nouns in English are monomorphemic. 
Barnyard animals make for a good subgroup: cow, horse, dog, cat, duck, goose, hen, cock. 
Words such as chicken and rooster are in fact bimorphemic— chick-en, roost-er. How¬ 
ever, due to the lack of systematicity in the language, many speakers do not interpret 
them as such. A circumstantially transparent word such as rooster becomes psychologi¬ 
cally opaque to the average English speaker, perhaps because of lack of priming. 
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4. componential priming in Chinese. In the case of Chinese the morphological com¬ 
ponents of a multimorphemic lexeme are not fused, by reason of language typology 
The phonological independence of each morpheme has led scholars to suppose in the 
past that each syllable constituted a separate lexeme. It is by now well established that 
phonological word boundaries and lexemic word boundaries are not necessarily coter¬ 
minous. (Li & Thompson 1998). Thus, a word such as ifetS chi fan ‘eat’ is a single unit 
for purposes of lexical insertion into a syntactic pattern, despite the phonological inde¬ 
pendence of its components, vfc chi eat’ and Wtfan ‘rice’. Nonetheless, psycholinguistic 
evidence shows that the component morphemes continue to enjoy psychological trans¬ 
parency, despite the lexical status of the compound. (Bates & Chen 1991). 

This componential make-up of a majority of Chinese lexemes, both nouns and 
verbs, is systematic in terms of being widespread, but not in terms of generativity. This 
is derivation, not inflection. We can explain how a word is derived after the fact, but 
we cannot absolutely predict what derivational paths will be followed. In that sense, 
there is no anticipatory system for speakers to follow. The patterns are both transpar¬ 
ent and productive, but not systematically generative. 

Nonetheless, derivation in Chinese is by and large open and accessible to the aver¬ 
age speaker. Relative to English, Chinese derivational morphology enjoys a high level 
of psychological transparency. 

This does not, however, mean that every derivation is equally transparent. For 
instance, the word for China, zhong hua, is composed of two morphemes. 
Speakers will readily explain that 4 1 zhong means middle, and that China is so called 
because it is at the center of the world. But the second morpheme hua is not as 
accessible. When questioned about it, speakers look confused and offer a variety of 
responses, but most admit they are not sure what it means. 

As it turns out, the reason for the relative opacity of T hua directly correlates with 
word frequency. The word T zhong is a common word in Modern Mandarin, and 
hence compound words derived from it are also transparent in terms of the meaning 
of that component of the compound. The word ^ hua is somewhat archaic. It appears 
in compounds and especially in names, but its more literal meaning is bleached and 
obscured. Hence, the opacity of the component morpheme is entirely circumstantial. 
Speakers do not have synchronic access to the word ^ hua, and hence they do not 
know what it contributes semantically to the compound. 

The circumstantial opacity described above contrasts nicely with the circumstantial 
transparency but psychological opacity that English speakers experience with the 
roost- of rooster. Chinese speakers do not know what hua means, so they also do 
not know what it contributes to TH zhong hua. But there are English speakers who 
do know what roost means, yet do not recognize it when it serves as the root for a 
morphologically complex word. 

The non-fusional nature of Chinese phonology may be credited with the ease with 
which speakers analyze the component morphemes of their multimorphemic words. 
However, this is not the case in Hebrew, where componential transparency is also the 
norm, despite the widespread use of discontinuous morphemes. 
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5. HEBREW MORPHOLOGICAL TRANSPARENCY VS. PHONOLOGICAL OPACITY. In the Case 

of Hebrew, derivational patterns are so predictable and the paradigms are so wide¬ 
spread that one can almost assert that Hebrew derivation is not only systematic, it is 
downright generative! An average speaker can be given the task of generating a new 
lexeme using a particular template and root, and in as little time as it takes to conju¬ 
gate a verb, he can produce a new lexeme. (Bolozky i999:xi). 

Hebrew derivation is not as predictable as Hebrew inflection, of course, but it 
comes very close. The wide gap between inflection and derivation that European lan¬ 
guages typically display is not present in a Semitic language such as Hebrew. 

It is well established that Hebrew derivational morphology is extremely productive 
and transparent to speakers (Bolozky 1999; Ephratt 1997). Therefore, in dealing with 
Hebrew, this paper will focus not on the question of psychological opacity, but rather 
on circumstantial opacity. It has been generally assumed that transparency in Hebrew 
morphology is directly correlated with systematic productivity. The question remains: 
how does the psychological transparency of a particular lexical item’s components 
survive phonological changes that create circumstantial opacity? Is there circumstan¬ 
tial opacity in current day Israeli Hebrew? 

Hebrew phonology has historically had the following stop phonemes /bdg ptk/ 
which were realized as their corresponding fricatives post-vocalically, and as stops 
everywhere else. There has been some erosion of this rule since the revival of Hebrew, 
in that It/, Id/, and Igl are never realized as fricatives regardless of environment. 
Recently there has also been much inconsistency in the realization of /b/, /p/ and /k/ 
as stops or fricatives. Productively, the stops are not fricativized, but they appear as 
fricatives in many frozen expressions, including many derived lexemes. This is a situ¬ 
ation which ordinarily leads to merger of phonemes and a high level of homonymy 
among roots. The expected result of such changes is circumstantial opacity of roots. 

Thus a lexeme such as /miknasajim/, meaning ‘pants’, realized as [mixnasajim], 
may not be circumstantially interpreted as being related to a word such as /knesset/, 
meaning ‘parliament’, realized as /kneset/, even though both historically come from 
the root /k-n-s/, meaning ‘to enter’. 

The approach to circumstantial opacity adopted here is somewhat akin to the issue 
of biuniqueness in phonology. Certain phonological theories assume a top down 
approach, looking at phonemic status from the point of view of the speaker. Requir¬ 
ing biuniqueness in a phonology, on the other hand, means looking at phonemic 
status from the point of view of the hearer. If the hearer has no way to determine from 
the sounds whether the [x] of [mixnasajim] is a het or a kajf, then under a biunique 
theory of phonology, all the [x] sounds have merged into a single phoneme, and the 
words derived from the root /k-n-s/ will also be split according to whether the kaff in 
a particular derivation is postvocalic or postconsonantal. 

In terms of psychological transparency, such a split has not occurred. Speakers 
still experience the root /k-n-s/ as an accessible morpheme. They are able to generate 
new lexemes using this root and familiar templates, even though they may falter in 
deciding whether to realize the Ik/ as a [x] or a [k]. Likewise, when they encounter 
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an established lexeme, the root /k-n-s/ has psychological reality for them, despite the 
circumstantial opacity created by phonological change. In fact, it is well known that 
Hebrew displays a concurrent semantic transparency in the face of phonological opac¬ 
ity (Barkai 1978,1980). Speakers know what a root means and recognize its occurrence, 
even if they are a little fuzzy as to the phonemes of which it consists. Circumstantial 
opacity does not prevent them from experiencing psychological transparency. 

The apparent implication for lexicalization is this: lexical status is not dependent 
on componential opacity. In turn, componential opacity or transparency seems to be 
a systemwide psychological phenomenon. Opacity is not necessarily brought about by 
the reanalysis of phonological boundaries. Sometimes it is a change in the entire system 
that renders a particular lexical derivation opaque, despite the circumstantial accessi¬ 
bility of its component parts. Speakers can perhaps tolerate a high level of circumstan¬ 
tial opacity, if derivation in the language as a whole is systematic and predictable. 

6. conclusion. Functional research in grammaticalization has equated lexicalization 
with low productivity, fusion and componential opacity. This paper has explored the 
possibility, using Chinese, Hebrew and English data, that circumstantial opacity and 
psychological opacity do not always correlate. We have seen that a lexeme can be 
psychologically transparent while circumstantially opaque (Hebrew), and conversely, 
that a lexical item can be circumstantially transparent but psychologically opaque 
(English). Likewise, Chinese morphological components of compound words were 
found to behave sometimes as more transparent and sometimes less, without regard 
to phonological fusion. Thus, the idea that fusion or opacity are predictable correlates 
of lexemic status has been shown to be questionable. 

Lexicalization, as a process, can occur as a result of reanalysis, but reanalysis itself 
does not necessarily lead to componential opacity. The opacity or transparency of the 
components of a word may be a psychological effect brought about by language-wide 
phenomena. Perhaps speakers are conditioned by the language as a whole to use com¬ 
ponential analysis as a productive strategy or to fail to make such use. 

The above observations are preliminary in nature. I am currently working on an 
experimental research project funded by the National Science Council of Taiwan. My 
investigation of componential transparency and opacity is ongoing. Analysis of the 
data I am currently collecting may eventually support or disprove the initial hypoth¬ 
eses set forth in this paper. 
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AN OPERATOR-BASED INTEGRATION OF 
COMPREHENSION AND GENERATION 
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This paper 1 discusses the theoretical and practical implications of using an 
operator-based framework for modeling natural-language comprehension and pro¬ 
duction processes. In particular, it presents aspects of the NL-Soar cognitive model¬ 
ing system that relate to these questions. It surveys the cognitive theory and modeling 
system that frame both types of processing, linguistic aspects of the comprehen¬ 
sion system and its relevant operators, and the generation system with its own oper¬ 
ators and processing stages. The degree to which these two systems exhibit shared 
functionality is then explored, and ongoing work is mentioned. 

1. background. Extensive study has been carried out on the capabilities and short¬ 
comings inherent in human processing of natural language (e.g., Yngve i960). Psy- 
cholinguistic evidence has pointed to observable constraints on complexity and 
capacity; for example, many types of grammatically correct but unparsable sentences 
have been described. This has led to discussions about characteristics of the human 
mind that may play a constraining role in our ability to process language, focusing 
on such issues as modularity, input/output buffering, working memory size (Gibson 
1991; Lewis 1996), parsing error recovery (Pritchett 1992), underlying processes (Gile 
1990), and capacity constraints (Just & Carpenter 1992). On the other hand, many 
computer systems that have been developed to carry out language tasks such as pars¬ 
ing and generation are based on computer architectures that do not always reflect 
the properties often ascribed to human-based processing of language. Whereas the 
cognitive plausibility of such systems is not a direct issue in this paper, the underly¬ 
ing assumption is that grounding a generation system in a more general theory of 
human cognition will lead naturally to an integrated framework where comprehen¬ 
sion and generation are both able to draw upon processes, goals, and linguistic knowl¬ 
edge structures that are maximally related. The NL-Soar system, as sketched in this 
paper, responds to these criteria. 

Describing the exercise of human intelligence including (but not limited to) lan¬ 
guage, deliberation, action, and performance within one general framework, the 
mind, is the goal of those seeking to establish a unified theory of cognition (Newell 
1990; Anderson 1993). Far from being simply a theoretical undertaking, though, this 
work also seeks to instantiate the approaches as working computational systems in 
order to demonstrate and explore assumptions about knowledge sources, behavior 
timelines, resources, and observable mechanisms and processes. 
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Soar (Newell 1990) is one system that has been used for widely varied cognitive 
modeling tasks such as game-playing, air-traffic control, and automobile driving. The 
natural-language processing instantiation of the general Soar architecture is called 
NL-Soar (Lewis 1993); NL-Soar is thus meant to be the language cognition subcom¬ 
ponent of the general Soar theory. It has been successfully integrated with Soar sys¬ 
tems designed to carry out other tasks (Rubinoff & Lehman 1994a; Lehman et al. 1995; 
VanDyke & Lehman 1997). After a brief introduction to NL-Soar comprehension, the 
current generation system (which was developed by the author) is sketched, and their 
integration is then explored. 

2. nl-soar comprehension. NL-Soar comprehension was the first comprehensive 
linguistic component based on the Soar system (Lehman et al. 1991). It directly 
implements Soar's basic characteristics: a production-rule knowledge base, an agent- 
oriented infrastructure, a specifiable goal hierarchy, problem spaces that encapsulate 
and delimit working subsets of memory, and learning via chunked behaviors. It was 
designed to model the performance of humans while comprehending difficult utter¬ 
ances with multiple embeddings and garden-path sentences; its behavior was shown 
to closely model that of humans for problematic and unproblematic constructions. 
More details about Soar and the NL-Soar comprehension system can be found in 
(Laird et al. 1993) and (Lewis 1993) respectively. 

The NL-Soar agent receives textual input sentences, tokenizes them, and stores the 
resulting words in a temporary buffer that is subject to decay. Words are attended to 
one-by-one and processed by the system. Processing involves accessing the lexicon 
for information about each individual word, and then incorporating the word into 
different types of linguistic structure. 

Two different but related linguistic structures are managed by the system. A 
phrase-structure tree representation called a syntactic model reflects the surface syn¬ 
tactic representation of a sentence. It is based on X-bar syntax and reflects constraints 
such as those operant in the distribution of topicalized constituents, empty categories, 
and movement processes involved in inflection. (Chomsky 1986). The semantic repre¬ 
sentation currently implemented in the system is based on Lexical-Conceptual Struc¬ 
ture (LCS) (Jackendoff 1990) and relates actions, states, processes with concepts and 
their attributes. This structure, called the semantic model, is meant to abstract away 
from morphosyntactic levels of representation. The use of other semantic representa¬ 
tion systems is possible, and in fact others have been used in the past (Newell 1988). 

Figure 1 shows syntactic and semantic models that the NL-Soar comprehension 
component creates upon processing the first part of the input sentence‘The doctor who 
called works at a hospital’. To begin with, after its introduction to the system the word 
the is recognized as a determiner; after the word doctor is processed a NP is formed 
and the semantic concept doctor, a Thing, is spawned. After the next word who enters 
the system, a CP complement is generated for the noun in the syntactic model. After 
processing the verb called, a verb phrase for the embedded clauses subject is created 
in the syntax, and the appropriate Action node in the semantic model is created and 
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Figure 1. Syntactic and semantic models incrementally being built after each word in 
the sentence fragment ‘The doctor who called...’. 


attached as a Property to the doctor concept. In fact much more information for both 
models is stored; the figure is only meant to sketch the most superficial attributes of 
each model in order to give an overall appreciation of the types of structures built. 

By design, the system is constrained to only allow one working hypothesis of each 
kind (syntactic and semantic) as processing progresses. As a result, in certain situa¬ 
tions such as garden-path constructions the wrong hypothesis will be pursued; when 
the current hypothesis becomes untenable in the light of subsequent context, the 
system is able to perform limited repair by snipping off the erroneous portions of 
the syntactic model, and then re-building it correctly. 

There is a close syntax/semantics correspondence: syntax is prior, and the system's 
semantic representation is interpreted from it. The category mappings are straightfor¬ 
ward: nouns map to basic concepts like Thing, adjectives map to Property, and verbs 
map to Action, State, or Event. The interaction between syntax and semantics extends 
to repair: syntactic snips initiate semantic snips which repair the semantic model in 
light of repairs to the syntax model. Similarly, there are semantics-based motivations 
for syntactic snips: an untenable semantic model can force previously constructed 
syntax to be re-analyzed. 

All of the above areas of functionality are performed by operators 2 . As in many 
traditional artificial-intelligence-based approaches, NL-Soar uses operators as the 
instrument for effecting state changes. An operator can be thought as an atomic bun¬ 
dling of actions that reflect some cognitive operation. Comprehension in NL-Soar 
performs lexical access, parse-tree construction, and semantic network construction 
piecewise, with (roughly) one operator of each type per word processed. For example, 
one syntactic operator would be used to combine the determiner the and the noun 
doctor into an NP. Similarly, a semantic operator would attach the concept called as a 
Property of the Thing concept doctor. 
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Op. 

Operation 

Related Constraints 

Example 

o23 

Link a DET as [spec NP] 

spec-head agreement, order, 
unique assignment 

the + meeting 
“a meetings 

o47 

Link an NP object 
as [comp V'] 

subcategorization, order, 
unique assignment 

saw + the cat 

“sneezed + her 

o76 

Link an NP subject 
as [spec IP] 

subject-verb agreement, order, 
unique assignment 

dogs + bark 
“men + sings 

ol2 

Project a node of cat 

XtoXP 

V => V' => VP 

sneezed 

o27 

Link an NP object 
as [comp P'] 

subcategorization, order 

to + the store 
“away + the dog 

o39 

Link an actor to its action 
as the external argument 

semantic constraints: class, 
animacy, saturation 

Person+Action 
Thing+Action 

o42 

Empty operator: note word, 
record as unintegrable into 
current context 

no actions possible at this 
stage of processing 

the 


Table 1 . Sample operator types and the linguistic operations they perform, related 
constraints, and examples where they would and would not apply. 


Table 1 shows a closer look at the linguistic nature of specific operators. o23 (the 
names are mere arbitrary tokens created by the system as they are learned) is an opera¬ 
tor that builds the specifier node into a noun-phrase (NP) constituent. In doing so, it 
must satisfy all of the requisite constraints specified by the grammar of the language; for 
English, there must be number agreement, the specifier must precede the head noun, 
and the two pieces must not have been previously assigned to other constituents. This 
operator would thus fail to build a constituent for the word sequence *a meetings, but 
would succeed for the meeting. The operator is of sufficient generality as to construct 
countless other NP’s in a similar manner, as long as the linguistic constraints are satis¬ 
fied. Now consider o39, an operator that links together pieces of semantic structure. In 
a sentence like the chair sneezed, one possible meaning interpretation is that a person 
(e.g., a college department chairman, of semantic class Person) performs the act of 
sneezing (an Action) and a less plausible one involves a man-made artifact (a chair, of 
semantic class Thing). This operator tests for semantic class compatibility, in essence 
testing and enforcing the linguistic constraints often called semantic cooccurrence 
restrictions. For example, o39 rules out Thing as the agent of the Action sneezed, and 
instead licenses the semantic attachment with class Person for its agent. Other opera¬ 
tors illustrated in the table perform other types of syntactic and semantic structure¬ 
building operations. The empty operator (in this case o42) notes the presence of a word 
for which no work can be currently done to integrate it into current syntactic or seman¬ 
tic context, at least until more information is provided (e.g., when new words are input 
to the system and thus added to the current context). 
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Figure 2 . Schematic overview of generation. Operators map pieces of the semantic 
model at the left into a corresponding syntactic model, depicted at the right. 

3. nl-soar generation. Given the previous discussion of comprehension in NL- 
Soar, it will be seen that much of the same principles and methods apply to the 
NL-Soar generation system. So far, work has focused on generating the output utter¬ 
ance via a syntactic model generated from predefined semantic structure; pre-tactical 
text planning (i.e. involving high-level considerations such as rhetorical structure, tex¬ 
tual cohesion, argumentation strategies, etc.) has not yet been implemented but should 
be possible given the hierarchical, goal-oriented Soar framework. There are two condi¬ 
tions that initiate tactical generation. The first is that some discourse move, such as 
responding to a question, prompts the NL-Soar agent to communicate. The second 
requirement for generation to proceed is that a coherent, connected LCS must exist. 

The crux of the current generation process is thus the mapping of semantic struc¬ 
ture to a well-formed syntactic utterance model. Once the syntactic structure has 
been generated, further processing generates the final output word sequence by read¬ 
ing off the lexical items from the terminal nodes of the structure. This results in a 
word-sequence (i.e. a sentence) which the agent outputs. A schematic representation 
of the central part of the generation process, mapping from a semantic model to a 
syntactic model, is given in Figure 2. 

Generation of a sentence, then, proceeds as follows. Once the system has deter¬ 
mined that a sentence must be generated, it traverses the connected semantic model 
to identify each concept and relevant annotations (such as tense, aspect, and plural¬ 
ity). All pieces of the model that could potentially contribute lexical material to the 
eventual output sentence are then considered globally. Particular types of operators, 
called selection operators, choose and prioritize which pieces of the model to process 
in which order. English operators, for example, specify that determiners must be gen¬ 
erated before nouns, and verbs before their object. 
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Once a given piece of the semantic model is chosen to process, work can continue 
in one of two modes. Recognitional processing (i.e., work that has already been ‘chun¬ 
ked up’, or learned from previous processing) can be carried out via pre-existing gen¬ 
eration operators. Generally, one generation operator is used to realize each concept 
of the semantic model, though the correspondence need not always be strictly one- 
to-one. When appropriate recognitional operators do not exist a priori, more costly 
deliberate processing must be carried out to create them. This process is sketched in 
the next paragraph. 

Each generation operator can be thought of as a bundled set of actions that allows 
generation to proceed. Its constituent actions include the following: 

1. lexicalizing the concept: This stage selects a portion of the semantic model and 
computes one way of rendering abstract concept(s) into some lexical item(s). 

2. performing lexical access: The lexicon is then accessed in order to retrieve 
syn/sem information for the targeted word(s). Words are given categories 
and projected to form lexical-phrasal items that can then be incorporated 
into the syntax model. 

3. building syntactic operators: This step takes the projected word and calcu¬ 
lates all possible ways it can be grammatically incorporated into the ongoing 
syntactic model, keeping track of agreement features, syntactic constraints, 
constituent structure, and available working memory. Note that any of the 
sample operators listed in Table 1 for comprehension can also be used in this 
stage of production. This point will be pursued in the next section. 

The three deliberate steps outlined above (lexicalization, access, and syntax model con¬ 
struction) may be bundled into a recognitional generation operator. This allows each of 
the three stages to fire in association with the others, and in the sequence specified. 

Occasionally this three-stage generation operator construction process does not 
succeed: inappropriate lexicalizations may be chosen, corresponding lexical items 
may be missing from the lexicon, or incompatibility may arise with the ongoing 
syntax model. When this happens, generation can recover and re-initiate the process 
with other selected words. This process continues until a successful outcome is 
achieved. Thus generation employs a generate-and-test strategy, iterating until a viable 
generation operator is constructed. This testing is done in lower, temporary problem 
spaces that are discarded upon completion. 

The goal of the NL-Soar generation procedure is to learn operators that render 
concepts/features from the semantic model into a grammatically correct syntactic 
model. This approach resembles Dorr's LCS-based approach for machine transla¬ 
tion (MT) (Dorr 1993) and subsequent related work in information retrieval; both 
approaches traverse a conceptual structure to perform incremental surface-language 
syntactic utterance generation. The operator mechanism is what distinguishes the 
NL-Soar approach from most other approaches. The lexicalization phase which estab¬ 
lishes a correspondence between appropriate pieces of the conceptual model and lexi- 
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cal items was inspired by efforts in realizing pieces of interlingua representation in 
more conventional, engineering-style knowledge-based system generation compo¬ 
nents; the lexical realization of objects, of predicates, and of features such as nega¬ 
tion, tense, and mood follow principles used in the generation component of an 
interlingua-based MT system (Lonsdale et al. 1995). 

4. integrating comprehension and generation. The version of the generation 
system described here takes the baseline NL-Soar system as a point of departure. The 
earliest version of NL-Soar only performed comprehension; later NL-Soar work did 
include operator-based generation (Rubinoff & Lehman 1994b), but with minimal 
overlap with the comprehension component. In developing the current generation 
system, an effort was made to share functionality across these two modalities, thus 
reducing system complexity. The result of this integration is that many of the same 
constraints, data structures, and system resources are used for both comprehension 
and generation. In a very direct and formal way, then, generation is built upon com¬ 
prehension; the significance of the work reported in this paper is that, as far as this 
author is aware, no system has yet been developed that exhibits all three of these 
properties in an architecture that is capable of learning the basic stages of linguistic 
processing. This section summarizes some areas of shared functionality between the 
two linguistic processes of comprehension and production: 

• Comprehension and generation both use the same lexicon. Morphosyntactic 
and semantic information for each word is encoded in productions that can 
be used in both processes. Additionally, this allows both to use the same lexi¬ 
cal access operator. 

• Comprehension and generation both use the same syntactic operators. Con¬ 
struction of the syntactic operator, whether during comprehension or gen¬ 
eration, proceeds in an identical fashion. Since this integration permits the 
same type of operators to be used for both, the two processes can bootstrap 
one another. For example, the operators such as those listed in Table 1 can be 
made available for processing by both linguistic modalities (comprehension 
and generation). Thus syntax operators learned in the process of compre¬ 
hension can be directly used for generation, and vice-versa. This transfer 
of knowledge between processes opens up many possibilities for exploring 
facilitation and interference between these two language subtasks. This has 
important implications for investigation of the human linguistic faculty, 
both in terms of performance and in competence. 

• Comprehension and generation both use the same architecture. The decay- 
susceptible input buffer also stores words for output generation of a sen¬ 
tence. A data structure called the assigners/receivers set, which restricts the 
number of syntactic constituents that can be stored at a time and computes 
their compatibility for combination, is shared by both processes. This set 
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constrains how much linguistic structure can be processed, in cognitively 
plausible ways (Lewis, 1993). 

• Comprehension and generation both use the same learning mechanisms. Just 
as comprehension operators can be learned, so too can generation opera¬ 
tors. Furthermore, since operators are compositional (generation operators 
incorporate lexical access syntax operators), their structure can be built up 
in turn by incorporating previously learned operators. 

Though there are compelling parallels in the architecture and processing meth¬ 
ods in the system, this does not preclude the existence of interesting asymmetries. 
In fact, since some human processing asymmetries have been described in compre¬ 
hension and generation, we would expect this to be modeled by NL-Soar. Such sit¬ 
uations do arise. In comprehension one often encounters certain local, temporary 
syntactic ambiguities that are usually successfully overcome when further context is 
introduced. For example, upon encountering the input sentence fragment: I have... 
the agent is not sure whether to comprehend the verb as an auxiliary verb or as the 
semantic-content verb indicating possession. The next few words usually serve to dis¬ 
ambiguate these alternatives in comprehension. On the other hand, since generation 
is driven by the underlying semantic model, there is no question as to the ambiguity 
of this construction at the point where the second word is generated. In other words, 
the generation component is less susceptible to some processing effects that charac¬ 
terize comprehension. 

Since the two processes draw upon the same data structures and basic operator 
architecture, it is also possible to interleave them. Thus, even though massively parallel 
processing is not available in NL-Soar, a very fine level of swapping between operators 
of different task types produces the overall impression of the simultaneous global per¬ 
formance of different tasks. Thus several tasks, including language (e.g., comprehension 
and generation), can be managed‘simultaneously’, (e.g., driving a vehicle while speaking 
on a cell phone) and can be interrupted and swapped (Nelson et al., 1994). 

5. ongoing work. In this section we sketch ongoing efforts by the author to extend 
the basic architecture and capabilities of the system. Current work on generation 
includes system-specific performance aspects. For example, the generation goal stack 
is quite deep since comprehension is embedded within generation; this incurs con¬ 
siderable chunking (learning) overhead. Occasional chunks (bundles of learned 
behaviors) fire out of their canonical order, introducing recognitional state-change 
sequences that differ from deliberation. A third issue is that, with long and complex 
sentences, the deep goal stack and attendant numerous state copies sometimes result 
in memory exhaustion. 

NL-Soar comprehension was also upgraded for French processing by adding a 
French lexicon, French-specific syntactic-model constraints (e.g., permitting object 
clitics before verbs), and relevant semantic model features (such as gender). This rela¬ 
tively simple process yielded a French comprehension capability. By leveraging the 
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same basic operator-based architecture and the approach outlined above, the devel¬ 
opment of a French generator was trivial. To date, no other target languages have yet 
been developed, though to do so would be straightforward. 

Specific properties of generation operators are still being addressed. How much 
semantic material should trigger generation of a piece of sentence structure? How 
many generation operators should be available at any given time, and how can they be 
appropriately selected? How specific should the chunks which encapsulate stages of 
an operator's application be? Should the granularity of generation operators be com¬ 
parable to those in comprehension? 

System improvements are also being made in the area of extending lexical cover¬ 
age of the system. Currently, the lexicon is largely hand-coded and reflects several 
hundred entries only. In an effort to scale up the system, an interface has been devel¬ 
oped with WordNet (Miller 1993), which provides the system with a much more sys¬ 
tematic description of semantic and syntactic properties. 

6. conclusion and acknowledgements. The system described in this paper 
reflects a general, operator-based approach wherein generation directly implements 
the comprehension infrastructure; the result is a basic paradigm for integrated, fully 
compatible language processing using lexical access, syntactic and semantic compre¬ 
hension, and generation. Interactions across these modalities such as bootstrapped 
learning, task interleaving, and shared knowledge structures reflect performance 
capabilities possible via this integration. The system's core approach assures an inter¬ 
esting tension between increased processing potential and cognitively constrained 
performance. This operator-based generator, which also draws upon analysis opera¬ 
tors, is thus a comprehensive language modeling system that exhibits shared func¬ 
tionality and architecture across language modalities. 


1 Thanks are due to fellow NL-Soar researchers at CMU (particularly Jill Fain Lehman, Rick 
Lewis, Julie VanDyke, Nancy Green, Robert Rubinoff, Greg Nelson) and at BYU (Nate 
Blaylock and Anton Rytting), and to members of the larger Soar project. Any errors are 
the responsibility of this author. 

2 NL-Soar also integrates a dialogue module that manages discourse turns and their 
related expectations. This aspect of the system has been documented elsewhere (Green & 
Lehman, 1996) and will not be discussed here. 
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introduction. Spontaneous oral language production is always remarkable in its 
unpredictability. A simple picture given to ten different people to describe will 
undoubtedly generate an equal number of unique utterances comprising different 
syntactic structures, lexical items, and intonational contours. Within all this variabil¬ 
ity there must exist certain factors which influence speakers to structure their lan¬ 
guage as they do; otherwise, we would be in a constant state of indecision about how 
to encode our message. One way to delve into this broad topic is to look at one par¬ 
ticular variable syntactic construction (verb-particle combination) and one potential 
sequence-influencing factor (discourse information flow). 

1. construction under investigation. The variable syntactic structure I chose for 
this experiment is the English verb-particle construction indicated in the following 
examples by italics. 

(1) pull up the anchor, bang in the door, hold off the enemy 

Despite being morphologically distinct, the verb and the particle do form a singular 
concept, evidenced by the fact that many of the verb particle combinations have one- 
word near-synonyms, as in (2). 

(2) raise the anchor, demolish the door, repel the enemy 

The verb-particle construction may often be confused with the V + PP sequence since 
they both can exhibit the same linear sequence of constituents. However, the verb-par¬ 
ticle combination is structurally distinct in that the particle is not fixed in its position 
relative to the noun phrase (NP). Whereas in (1) the particle precedes the NP object, in 

(3) the particle follows the NP object with no great change in meaning or effect. 

(3) pull the anchor up, bang the door in, hold the enemy off 

Since both orderings of constituents can be used to describe the same scene or 
can generate the same interpretation, it seems obvious that there is more at work 
than purely semantics in the eventual determination of one order over the other. 
Of the numerous potential factors which may influence the ultimate sequence, this 
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experiment investigates the role of a particular discourse factor in the course of oral 
languages production. 

2. THE EXPERIMENT. 

2.1 motivation. In 1967, Halliday brought to our attention a tendency for ‘given 
information (information already known in the discourse context) to precede ‘new’ 
information (information not previously mentioned or alluded to) in the linear pre¬ 
sentation of a sentence. 

If one applies this logic to the sequencing of constituents in the verb-particle con¬ 
struction, it may be that the principle of given vs. new information will influence 
the distribution of the particle in relation to the object NP. More specifically, if the 
entity encoded by a particular object NP has not been previously mentioned in 
the discourse (and would therefore be considered ‘new’), would it surface after the 
particle more often than in cases where the entity has already been mentioned, in 
which case, (being ‘given) it would precede the particle? 

2.2 METHODOLOGY. 

2.2.1. participants. 56 native speakers of English participated in this experiment. 
Most were undergraduate students from the University of Alberta, between the ages 
of 20 to 30, and who had grown up in western Canada. A handful of participants, 
however, were older or from a different English speaking region of the world. Many 
individual responses from each participant did not follow the target verb-particle 
form so could not be used in the analysis. 

2.2.2. stimuli. Participants were presented with simple black and white line drawings 
of basic agentive events. These pictures were designed to elicit particular target struc¬ 
tures in participant responses. The target responses included eight different particles 
(i.e., away, back, down, in, off, on, out, up) and nine different verbs (i.e., pull,put, hold, 
cut, fill, shut, turn, cross, blow). Since the meaning of the verb ‘puli’ does not explicitly 
denote directionality, it was used in combination with all of the stimulus particles. 
See Appendix A for a complete list of target verb-particle combinations, along with 
the expected NP objects. This set of particles was chosen because they are all produc¬ 
tive, frequent, and convey spatial relations that can be depicted relatively easily. Simi¬ 
larly, these particular verbs were chosen because they are fairly frequent and easy to 
portray graphically. These qualities were important to consider so as to prevent delays 
that could potentially occur if participants were trying to retrieve an infrequent or 
unclearly represented word from their lexicon. 

Participants were prompted to describe each scene by the textual presentation of the 
subject NP and the verb at the bottom of the screen. In all target trials, the NP subject 
written on the screen coincided with the agent of the action so they were all active sen¬ 
tences. All of the agents in the pictures were non-specific people (e.g., boy, woman, fish- 
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erman, etc.). All of the patients or themes (thus the objects) were common non-human 
entities whose linguistic labels are morphologically simple and fairly frequent. 

In order to prevent participants from thinking about the types of syntactic struc¬ 
tures they were producing during the experiment (especially to avoid conscious 
processing of verb, particle, and noun phrase ordering), I included numerous filler 
stimulus trials in each section of the experiment. The filler stimulus trials included 
four scenarios which facilitated dative responses, four scenarios which facilitated pas¬ 
sive voice responses, and eight scenarios which facilitated basic transitive sentence 
responses. (For a complete list of the anticipated filler responses, see Appendix B). 
Thus, the total number of stimulus item sets was 16 target sets and 16 filler sets. 

2.2.3. procedure. The basic task involved was a picture-driven sentence-completion 
task. Participants sat in front of a desktop computer and controlled the presentation 
of stimuli with the space bar of the keyboard. In the first half of the experiment, at the 
start of each trial, the participants would see the simple black and white line draw¬ 
ing of the event. Two seconds after presentation of the picture, the start of a sentence 
(NP subject and verb) would appear at the bottom of the screen. Participants were 
to read aloud the start of the sentence and keep talking to provide a completion that 
described the picture. Responses were tape recorded and later transcribed. Once they 
had finished the utterance, participants pressed the space bar to start the next trial. 

The second half of the experiment was slightly different in that some additional 
information was provided to the participants before each trial. Prior to presentation 
of each picture, two clauses would appear on the screen. Again, the participants would 
read those clauses out loud, after which they would press the space bar to prompt 
display of the picture. Two seconds later, the start of a sentence would appear at the 
bottom of the screen and participants would complete each sentence in the same 
manner as section one. 

The two clauses uttered before each picture-presentation acted to provide the par¬ 
ticipant with some background context for each picture. In half of the cases, the back¬ 
ground context explicitly mentioned the entity that would, hopefully, function as the 
NP object in the utterance. This prior mention of the object would render it ‘given. 
In the remaining half of the trials in this section, the background context would not 
explicitly mention the NP object thus leaving the entity as new information to be 
introduced in the picture. 

To recap, there were three different conditions in the experiment: a no-context 
condition where no prior linguistic context was given, a new-context condition where 
the target object was not mentioned in the prior linguistic context, and a given-con¬ 
text condition where the target object was mentioned in the prior linguistic context. 

In order to generate data on each picture displayed in each of these three contexts, 
three separate groups of participants were required so that no one person would see 
the same picture more than once. The two background contexts were controlled to 
be, syntactically, as similar as possible for each picture, to prevent some other factor 
in the background contexts from coming into play. For example, the new-context and 
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given-context clauses for a picture which portrayed a woman blowing up a balloon 
are given below. 

(4) a. New-Context 

Clause One: The woman decided she wanted decorations for the party 

Clause Two: Before all of the guest arrived... 

Sentence Start: the woman blew... 
b. Given-Context 

Clause One: The woman decided she wanted balloons for the party 

Clause Two: Before all of the guest arrived... 

Sentence Start: the woman blew... 

Notice how clause number one is very similar in both contexts and how clause 
number two is exactly the same in both contexts. The reason for including the inter¬ 
mediary clause was to introduce a new participant to the discourse so that the speaker 
would less likely use a pronominal object in the target utterance. Since pronominal 
objects are, for the most part, restricted to surfacing before the particle, responses 
including a pronoun would not be revealing or included. 

2.3. predictions. The no-context condition was included to generate some kind of 
base-line measure from which to view the results of the contextualized conditions. If 
no other factors exist which may influence the ordering of the particle and object NP 
then we may expect a random patterning in the no-context condition, where about 
half of the responses have the particle preceding the object and the other half follow¬ 
ing the object. However, because there are many other considerations being processed 
by the speaker, there may be one ordering which emerges as the preferred one in iso¬ 
lated utterances. 

The orderings of the contextualized conditions are easier to predict because the 
target stimuli are being treated in this experiment so as to influence one particular 
outcome. If the given vs. new principle is at work in the surface realizations of the 
particle and object, then one would predict that the object will surface in front of 
the particle when it is considered given information more often than in cases where 
it is considered to be new information. 

2.4. results. When considering all the results together, an overall tendency emerged 
for the verb and particle to be adjacent (with the NP object following). 61% of all 
utterances (n = 605) followed this order. So, it appears that this principle of ‘verb- 
particle adjacency’ is an underlying force that other factors need to overcome in order 
to have any sequencing effect. 

The predictions above indicate that verb-particle adjacency would be less frequent 
in contexts where the NP object is given than in contexts where the NP object is new. 
This predication is exactly what was realized in this experiment. In the given-context, 
verb-particle adjacency applied in 54% of the utterances (n = 185), whereas in the 
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No Context 

Given 

New 


Figure 1 . Response orders across all contexts. 




Figure 2. NP = 2 words responses over all contexts. 


new-context, verb-particle adjacency applied in 69% of the utterances (n = 212). The 
no-context condition fell in the middle with verb-particle adjacency adhering in 59% 
of the utterances (n = 208). Figure 1 demonstrates these results in the form of a graph. 
Notice how the bar representing the new contexts and the bar representing the given 
contexts intersect at fairly large angles indicating a strong interaction. A chi-square 
analysis reveals that this interaction is indeed significant (y 2 = 9.819, p = 0.002). 

An independent syntactic factor which influences ordering is the length of the NP 
object. As discussed in Fraser (1975) and demonstrated by Hunter (1981), the more 
words in the NP object, the more likely this object will be realized at the end of the 
verb phrase after the particle. Since I did not want this variable to artificially inflate 
the frequency of verb-particle adjacency, (especially in the predicted new-context), 
I also analyzed the set of responses that only included NP objects which were two 
words in length (Det + N). Again, the results agreed with the predictions with verb- 
adjacency adhering most in the new-context (65%, n = 171), least in the given-context 
(53%, n = 169), with the no-context responses falling in the middle (56%, n = 185). 
Figure 2 demonstrates these results graphically with the same basic interactions as are 
found in Figure 1. As before, the interaction between the new-context and the given- 
context results are significant (y 2 = 5.270, p = 0.02). 

Since the word pull had been paired up with all of the different particles, an analy¬ 
sis of the pull results may reveal if the specific verb has any affect on the verb-particle 
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adjacency and given vs. new distribution. The results show that the pull cases behaved 
in very much the same way, with verb-particle adjacency applying in 50% of the 
given-context cases (n = 90) and 65% of the new-context cases (n = 97), another sig¬ 
nificant result (y 2 = 4.276, p = 0.036). 

Although the target particles were defined as that set of eight, in an oral language 
elicitation task, the range of responses is always greater than anticipated. For this reason, 
the results included five additional particles (i.e., open, apart, aside, along, aboard). Along 
with the anticipated particle away, the total number of disyllabic types of particles used 
was six, with 58 tokens of those types being articulated. An analysis of the disyllabic 
particles in comparison with the monosyllabic particles revealed an interesting trend. 
Whereas 65% of the monosyllabic particles (n = 547) were realized adjacent to the verb, 
only 27% of the disyllabic particles (n = 58) were realized adjacent to the verb. Almost 
three quarters of the disyllabic particles were uttered after the NP object. A chi-square 
analysis demonstrates the certainty of this outcome (y 2 = 33.641, p < 0.005). 

One last area of analysis to consider is the behaviour of individual verb-particle 
combinations. For the most part, individual pairings did not have enough individual 
tokens to produce a statistically revealing analysis. However, some pairings did seem 
to show certain patterns of behaviour. The combination cut down could have been 
the poster boy for the given vs. new effect camp with verb-particle adjacency only 
holding in 27% of all the given-context utterances (n = 15) but in 82% of all the new- 
context utterances (n = 17). On the other end of the spectrum would be the combina¬ 
tion blow up which seemed to defy influence of given vs. new information with a high 
81% of given-context utterances (n = 16) showing verb-particle adjacency. Although 
this number seems very high, it is still quite a bit lower than the new-context cases 
where 100% of utterances demonstrated verb-particle adjacency. 

One last combination worthy of mention is the hold back pairing. Only two out of 
seventeen people uttered the NP object after the particle as would be the predicted 
order according to the verb-particle adjacency pattern. Whereas only two speakers 
said ‘hold back the dog’, fifteen uttered something like ‘hold the dog back’. One pos¬ 
sible explanation for this seemingly contradictory result is that back is not really a 
particle in this case but is functioning more like a place adverb indicating the loca¬ 
tion where the event is taking place (i.e., back away from something else). If back was 
an adverb in these cases it would be an adjunct and would, as such, be realized after 
the NP object complement which is adjacent to the verb. For this explanation to hold 
water, it must also account for the other back data collected for the pull back pairing. 
Although the number of legitimate pull back responses is quite low, the results seem 
to contradict the hypothesis that back is an adverb and not a particle. Over half of the 
total given-context responses (n = 7) have back surfacing in between the verb and NP 
object and almost all of the total new-context responses (n = 6) have the NP object 
at the end. Therefore pull back and hold back seem to be behaving quite differently. 
Arguing that back is an adverb in one case and a particle in the other does not seem, 
to me, to be the best approach. Perhaps, instead, hold back is an idiosyncratic pattern 
where the sequencing of an intervening NP is becoming fixed and resistant to any 
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outside ordering pressures. Blow up could also be a case for certain sequences becom¬ 
ing fixed, although this pairing appears to resist separation by an NP object. 

3. conclusion. Verb-particle constructions are interesting to the researcher of lan¬ 
guage production because of the variable ordering of the particle in relation to the NP 
object. This particular study focused exclusively on oral language production and the 
interaction of discourse context and the placement of the particle. The most pervasive 
result was the emergence of the verb-particle adjacency pattern whereby speakers 
showed a marked preference for the production of the verb and particle next to each 
other, with the NP object surfacing at the end. Because the verb and particle form a 
conceptual unit, it is understandable that this conceptual unity would translate into 
structural unity via syntactic adjacency. However, obviously this structural unity does 
not hold all of the time or else the verb-particle construction would not be consid¬ 
ered ‘variable syntax. Thus, there must exist certain factors which influence alternate 
orderings or strengthen the frequency of occurrence of the unmarked order. One 
such factor is the given vs. new property which predicts that if the NP is already given 
in the discourse context, it will be realized more often in front of the particle than if 
that NP has not already been mentioned in the conversation. This influence of infor¬ 
mation flow in a sentence was shown to be at work in many different sets of utter¬ 
ances recorded during the experiment. One set included all of the responses whereas 
another was controlled for NP object length. The pattern emerged in both of those 
sets just as when only the pull verb combinations were considered. Certain individ¬ 
ual verb-particle combinations seemed to display idiosyncratic behaviour whereby 
they were either very subject to influence by the verb-particle adjacency preference 
or were quite resistant to this influence. The given vs. new principle obviously plays a 
small role in the determination of constituent ordering in the verb-particle combina¬ 
tion. The multitude of additional factors at work must also be tested and assessed for 
a better understanding of syntactic production. 


APPENDIX A: STIMULI VERBS, PARTICLES, AND 
ANTICIPATED NOUN PHRASE OBJECTS 


PARTICLE 

VERB 1 

(freq.) 

D.O. 

VERB 2 

(freq.) 

D.O. 

AWAY 

pull 

145 

the wagon 

put 

513 

the toys 

BACK 

pull 

145 

the curtain 

hold 

509 

the dog 

DOWN 

pull 

145 

the screen 

cut 

245 

the tree 

IN 

pull 

145 

the net 

fill 

184 

the hole 

OFF 

pull 

145 

the sticker 

shut 

5° 

the light 

ON 

pull 

145 

the shirt 

turn 

566 

the radio 

OUT 

pull 

145 

the tooth 

cross 

84 

the number 

UP 

pull 

145 

the anchor 

blow 

52 

the balloon 
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APPENDIX B: FILLER SENTENCES (COMPLETIONS PREDICTED) 


Transitive 

Dative 

1 . 

The boy kicked the ball over the pylon 

9- 

The woman read the book to the chil¬ 

2 . 

The boy threw the ball into the net 


dren 

3- 

The woman sat in front of the TV 

10 . 

The girl mailed a letter to Santa Claus 

4- 

The boy ate the carrot at 3:00 pm 

11 . 

The man offered some cookies to the 

5- 

The woman drove the car into the car 


woman 


wash 

12 . 

The girl gave a present to her friend 

6 . 

The girl flew a kite under the power 




lines 

Passive 

7- 

The woman closed the door to Dr. 

13- 

The flower was being watered by the 


Noams office 


woman 

8 . 

The boy built a sand castle under the 

14- 

The walkman was being listened to by 


umbrella 


the boy 



15- 

The paintbrush was being used by the 




woman 



16 . 

The letter was delivered by the postman 
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DOES SYNCHRONY RECAPITULATE DIACHRONY? 
A CASE STUDY FROM JAPANESE 


Gary D. Prideaux 
University of Alberta 


to what constraints 1 should a synchronic linguistic description (an analytic 
description in Lamb’s (1999) terms) be subject? Should it be one which, following 
Chomsky (1965), captures the native speaker intuitions? Should it be the most parsi¬ 
monious (ignoring for the moment the question of how this is measured)? Should 
it be one which captures linguistically significant generalizations? Within generative 
methodology, it is assumed that these three conditions are all congruent, and a par¬ 
ticular description is the ‘correct’ one if it satisfies all three. 

In the earliest proposals offered up by the generative grammarians, it was claimed, 
albeit implicitly, that the underlying forms of contemporaneous lexical items might 
lie in their historical antecedents. Could, for example, mice be analyzed synchronic- 
ally in terms of its historical development as a sequence of the Old English form 
in which /mus/ and its plural /—i/, coupled with phonological rules to account for 
(a) vowel harmony (/miisi/), (b) open syllable lengthening (/mii:si/ (c) loss of the final 
vowel /mii:s/, (d) the front vowel unrounding, yielding /mi:s/, and finally (e) the Great 
Vowel Shift altered /i:/ to /ai/, yielding the modern form /mais/ ‘mice’. In such cases, 
it would appear that the history of the language can ‘explain’ irregular modern forms. 
While the Sound Pattern of English (Chomsky & Halle 1968) was an advocate of this 
claim, it was seriously challenged within many studies. Are, for example, milk and lac¬ 
tose and galaxy all related synchronically as well as diachronically? The issue seems in 
retrospect to be at best amusing and at worst ludicrous. 

However, it is precipitous to dismiss out of hand the possibility that historical 
changes might be directly reflected synchronically within complex morphosyntactic 
structures. As Hale (1973) pointed out long ago, cases are attested in which sound 
changes can force the restructuring of entire paradigms. These two possibilities, his¬ 
torical recapitulation and restructuring, taken as extreme poles, allow in principle for 
competing linguistic descriptions of a particular phenomenon, one of which reca¬ 
pitulates the history of the language while the other reflects a restructuring. Which 
analysis would be theoretically preferred? Presumably the one which better reflects 
the native speaker’s intuitions, even if that description is less parsimonious. 

It is the purpose of this paper to address this problem through an examination of 
the Japanese adjectival paradigm. Three alternative descriptions of the paradigm are 
presented, one which is quite transparent, cleaves to the surface and reflects a histori¬ 
cal restructuring of the paradigm, a second which is quite abstract but which recapitu¬ 
lates the history of the language, and a third taken from traditional Japanese school 
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Gloss 

Present 

Past 

Provisional 

Conditional 

eat 

taberu 

tabeta 

tabereba 

tabetara 

wait 

matu 

matta 

mateba 

mattara 

write 

kaku 

kaita 

kakeba 

kaitara 

be (inanimate) 

aru 

atta 

areba 

attara 


Table 1. Some verb inflections. 


grammar. Each analysis makes its own predictions as to how native speakers of Japa¬ 
nese process adjectives. A simple experiment was conducted to address this issue, and 
the three analyses are evaluated against the results of those native speaker judgments. 

In what follows, we will first review the Japanese paradigms, and then offer the 
three analyses. We then turn to the experiment and show that a highly elegant analy¬ 
sis, one which purports to capture linguistically significant generalizations about the 
language, actually fails to characterize native speaker intuition. 

1. Japanese verbs and adjectives. In Japanese, adjectives, like verbs, are inflected 
for tense as well as for several other categories. Japanese is an agglutinating lan¬ 
guage, rich in sequences of suffixes but relatively impoverished in prefixes. In order to 
develop the alternative analyses of the adjective paradigm, it is helpful to have before 
us the verbal paradigm as well. Table 1 provides a selection of verb forms, inflected for 
a few of the many possible categories. 2 

Japanese verbs are typically partitioned into two classes, those with vowel-final 
stems (/tabe-/ ‘eat’) and those with consonant-final stems (e.g., /mat-/ ‘waif, /kak-/ 
‘write’, /ar-/ ‘be’). The present tense suffix has two allomorphs, /-ru/ after a vowel and 
/-u/ after a consonant. Various fairly obvious phonological rules are also involved, 
including two very general and important ones: 

(1) k, g ->• 0 / ] i (lenition of velars at a morpheme boundary (]) when /i/ 

follows) 

(2) r -> t / ] t (assimilation of /r/ to /t/ at a morpheme boundary) 

In addition, it is clear that the two allomorphs of the present tense morpheme 
(/-ru/ and /-u/) can be reduced to one, /-ru/, by the following consonant-cluster 
reduction rule: 

(3) r ->• 0 / C ]_ (reduction of non-geminate consonant cluster 

at a morpheme boundary) 

This rule expresses the general phonological constraint that the only types of per¬ 
mitted consonant clusters are geminates and nasal + consonant. Thus, the verb para¬ 
digm can be expressed in part 3 as in Table 2. 
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Verbal Inflectional Suffixes 
Present Past Provisional Conditional 

-ru -ta -reba -tara 

Table 2. Some verb inflectional suffixes. 


Gloss 

Present 

Past 

Adverbial 

Provisional 

Conditional 

tall 

cold 

large 

takai 

samui 

ookii 

takakatta 

samukatta 

ookikatta 

takaku 

samuku 

ookiku 

takakereba 

samukereba 

ookikereba 

takakattara 

samukattara 

ookikattara 


Table 3. Some adjective inflections 

It is, however, the adjectival paradigm which is our present concern. An example 
of the ways in which the adjective can be inflected is found in Table 3. 

The questions at the heart of this paper are: What is the best way to treat the adjec¬ 
tival paradigm? Should adjectives and verbs be treated separately, each with its own 
distinct set of inflections, or should they be treated within a single system of inflec¬ 
tions? The empirical issue is: How do native speakers of Japanese conceive of this 
paradigm? It is to a comparison of different analyses of the adjective paradigm that 
we now turn. 

2. competing analyses. Many descriptive linguists who have worked on Japanese, 
such as Jordan (1963), treat verbs and adjectives as two distinct lexical categories, each 
with its own inflectional paradigm, while others have tried to collapse the two into a 
single category. The following three analyses are broadly representative of the families 
of analyses which have been proposed. The first maintains the distinction between 
verb and adjective inflections, while the second does not. The third is a traditional 
Japanese grammatical analysis. 

2.1. concrete analysis. The most common description offered for the two par¬ 
adigms is one which is both transparent and relatively simple. It treats verbs 
and adjectives as separate lexical classes with the stipulation that both are 
marked for virtually the same set of categories, though with different allomorphs 
for each. Within this analysis, adjective stems are all vowel-final (e.g., /taka-/ 
‘tall’), and the present tense marker is /—i/, while the past tense marker is 
/-katta/. According to this analysis, the adjective should be parsed as in (4). 

(4) ADJstem + suffix (e.g., taka+katta > /takakatta/) 

This highly transparent analysis is summarized in Table 4. An important feature of 
this analysis is that each suffix is monomorphemic, each with its own unique semantic 
content. 
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Present 

Adjectival Inflectional Suffixes 
Past Adverbial Provisional 

Conditional 

-1 

-katta -ku -kereba 

-kattara 


Table 4. Concrete adjective analysis. 


Present 

Adjectival Inflectional Suffixes 

Past Adverbial Provisional Conditional 

-0 

-ar-ta -u -ar-eba (=ereba) -ar-tara 


Table 5. Abstract adjective analysis. 

This analysis makes the empirical prediction that native speakers will treat the 
stem as vowel-final and constant throughout the paradigm, and that each of the suf¬ 
fixes is monomorphemic. 

2.2. abstract analysis. Since verbs and adjectives participate in almost identical par¬ 
adigms, since they function identically in main and subordinate clauses, and since 
the only other inflected category in the language is the copula, it is tempting to 
formulate a single unifying analysis encompassing both verbs and adjectives. Such 
an analysis has been proposed by, among others, Makino (1968). This typical genera¬ 
tive analysis is both highly parsimonious, since it treats verbs and adjectives in a uni¬ 
tary manner, and also to a large extent it recapitulates the historical development of the 
adjective paradigm (Chamberlain 1907, Henderson 1945, Martin 1987, Miller 1968). 

The essence of this analysis is that adjective inflections are derived from a sequence 
of an adjective stem and an inflected form of the existential verb /aru/. Such an analy¬ 
sis clearly mirrors the historical development of the paradigm, as Henderson (1945), 
Martin (1987), and others have shown. However, this analysis requires that the adjec¬ 
tive have a stem-final Ik/, which is common throughout the paradigm except in the 
present tense. As Chamberlain (1907) and Miller (1968) both observe, the Ik/ is an 
adjective class marker in Old Japanese. According to this analysis, then, any adjective 
should, in all cases but the present tense, be parsed as in (5). 

(5) ADJstem + ar + suffix (e.g., takak+ar+ta > /takakatta/) 

According to this more abstract but historically transparent analysis, the inflectional 
suffixes for the adjectives in table 3 should be treated morphologically as in Table 5. 

But how is the most frequent and common form, the present tense of the adjective, 
to be handled? Makino’s (1968) solution is that the present tense, the most unmarked 
member of the paradigm, has no suffix at all. Thus, the present tense form is simply 
the stem, e.g., /takak/). But a completely general phonological constraint of Japanese 
is that a word may not end in a non-nasal consonant. The phonological resolution of 
this problem is the addition of an epenthetic vowel, typically /-i/, after the final (non¬ 
nasal) consonant, as represented in rule (6). 
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(6) 0 —► i / C # (the epenthetic vowel /i/ is added to a word-final conso¬ 

nant, where # is a word boundary) 

Thus, the present tense (unmarked) form for an adjective would simply be: 

(7) ADJstem # (e.g., /takak#/) 

And to this stem would be added the epenthetic vowel /i/ (by rule (6), yielding: 

(8) ADJstem - i # (e.g., / takaki#/) 

We recall the general rule of velar lenition, the deletion of a Ik/ or /g/ at a morpheme 
boundary before a high front vowel (rule (1)). The derivation of a present tense form 
of an adjective like /takai/ ‘tall) would then be : 

(9) takak# > takaki# (rule 6) > /takai/ (rule 1) 

Thus, this analysis, while quite abstract, nevertheless recapitulates the historical 
development of the adjective paradigm. Moreover, it is parsimonious to the extent 
that it unites adjectives and verbs into a single analysis 

This analysis, like the first, makes an empirical prediction. It predicts that native 
speakers of Japanese recognize that an inflected adjective consists of a stem (ending 
in /-k/) plus an inflected form of the verb /aru/, with virtually all the suffixes being 
polymorphemic. 

2.3. traditional analysis. A third analysis should also be explored here, namely 
that of the traditional Japanese school grammarians (see Chamberlain 1907, Ikeda 
1980). This analysis is highly detailed and treats verbs and adjectives as inflected 
forms, with basically the same categories for each, in spite of the fact that the suffixes 
are quite different. However, this traditional analysis treats every member of the 
adjectival paradigm as having its own unique stem and unique suffix. One reason 
for this complexity derives from the Japanese writing system which is composed of 
kanji, borrowed and adapted Chinese characters (used primarily for the meaning¬ 
bearing roots of nouns, verbs, and adjectives) and syllabaries (used for inflections, 
function words, and the like). The syllabary used for inflections (hiragana), like the 
other ( katakana ) system, does not permit the writing of a single consonant other than 
a nasal or a geminate cluster. It is impossible to have within the traditional Japanese 
writing system a break between all phonemic segments which an alphabetic system 
allows. Since only syllable (or mora) boundaries are expressible, it would not be pos¬ 
sible to write a form such as /takaku/ as takak-u but only as taka-ku. 

The traditional Japanese analysis of the adjective forms is primarily taxonomic in 
nature and is exemplified in part in Table 6, for the adjective /takai/ (after Henderson 
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Adjectival Inflectional Suffixes 


Stem 

Suffix 

Gloss 

Traditional Japanese label 

taka 

-i 

present tense 

rentaikei 

shuushikei 

‘prenominal form’ and 
‘terminal form’ 

takakat 

-ta 

past tense 

renyookei 

‘adverbial form 

takaku 

-naru 

adverbial 

renyookei 

‘adverbial form 

takakaro 

-0 

tentative 

mizenkei 

‘incomplete form’ 

takakere 

-ba 

provisional 

izenkei 

conditional form’ 


Table 6. Traditional Japanese adjective analysis. 


1945). As Table 6 reveals, each member of the paradigm has its own unique stem and 
its own suffix. 

The traditional analysis also makes empirical predictions as to how native speakers 
will parse their adjectives, namely that there will be different stems for each member 
of the paradigm, plus distinct suffixes, each uniquely keyed to its stem allomorph. 
This is highly complex, ad hoc, and presents a quite awkward and counter-intuitive 
analysis. It does not reflect any overall generalizations nor does it respect obvious 
candidates for morpheme boundaries. 

In order to test these three quite distinct analyses of the Japanese adjectival para¬ 
digm, a simple experiment was carried out. In it, native speakers of Japanese were 
asked to parse Japanese verbs and adjectives into as many components as they con¬ 
sidered relevant. It is to a discussion of this experiment that we now turn. 

3. the experiment. The 26 volunteer participants in the experiment were 17 females 
and 9 males, ranging in age from 17 to 59, though the vast majority were in the 30-50 
age range. All had received their pre-university schooling in Japan. Both the instruc¬ 
tions and the stimulus materials were provided in Japanese. Since, as we saw above, 
the Japanese writing system uses both kanji and two distinct syllabaries, it is not 
possible to separate all phonological segments within a syllable. Thus, the stimulus 
materials were presented in both the native orthography and in a romanized version 
(romaji ). The stimulus materials contained a set of randomly ordered conjugated 
verbs and adjectives. In all, 22 different adjectives were used, and while not all of the 
many forms of any adjective or verb were included, those cited in Tables 4 and 5 
above were all included. Each of the five paradigm members represented in Table 2 
was represented with eight tokens, yielding a total of 40 adjectives in the stimulus 
set. Distracters were also included. Participants were instructed about morphemes as 
meaning units and were given examples (taken from nominal, not verbal or adjecti¬ 
val, constructions). They were then asked simply to divide each stimulus item into as 
many meaningful units as they thought were present. The task was not onerous and 
took only about 20 minutes to complete. 
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Present 


Suffix 

Past 


Adverbial 

0 

i other 

katta 

kat-ta atta 

ka-tta 

ku k-u 0 

139 

67 2 

157 

26 8 

17 

172 8 28 


Table 7. Frequency of parses for present, past, and adverbial forms. 



Suffix 

Provisional 

Conditional 

kereba kere-ba ke-reba ereba reba 

kattara katta-ra kat-tara attara kat-t-ara ka-ttara 

117 71 2 8 10 

70 57 41 8 8 24 


Table 8. Frequency of parses for provisional and conditional forms. 


4. results and discussion. The patterns found in the data were not random, but 
rather interesting and informative patterns emerged. The tabulated results for the pres¬ 
ent, past, and adverbial data are found in Table 7. These forms are highly frequent in 
the language. The provisional and conditional data, far less frequent both in conversa¬ 
tional and written language, are found in Table 8. We examine each set separately. 

For the present tense, by far the majority of participants treated a simple adjective 
such as /takai/ as having no suffix at all, in spite of the fact that the word is written in 
Japanese as a single kanji for the root /taka-/, followed by the syllabic HI. While there 
is not complete unanimity amongst the participants, there is a strong statistical ten¬ 
dency for the present tense to be viewed as monomorphemic (\ 2 = 135.5 ,p < .00001) 
in contrast to both traditional and the concrete analyses, both of which treat /—i/ 
as a tense marker suffix. It should be noted that for the present tense, however, the 
‘runner-up’ is /-i/. Thus, for the present tense, the abstract analysis wins. 

The past tense form also shows considerable variation, but the dominant pref¬ 
erence across the participants is for /-katta/, a single morpheme marking the past 
tense (y 2 = 285.8, p < .00001), with a distant second the form /kat-ta/, which is pre¬ 
dicted by none of the analyses, though the abstract analysis does predict a morpheme 
boundary within this form, so this provides some support for the abstract analysis as 
the runner-up. Finally, the adverbial preference is for /-ku/ as the suffix (y 2 = 230.9, 
p < .00001), though a goodly number of participants chose to view this as monomor¬ 
phemic. Here the concrete analysis wins handily, with the abstract and traditional 
analyses representing the runners-up. 

For the first three, highly frequent, forms of the adjective conjugation, it appears 
that none of the three analyses is uniformly supported. In particular, the more con¬ 
crete analysis is supported for the past and adverbial forms, while the more abstract 
analysis seems supported for the present tense. Or is it? We return to this below, but 
first let us examine the other two forms of the adjective which were investigated here. 

For the less frequent forms, those found in table 8, we find much more scatter 
and variation, as might be expected. For the provisional member, the clear winner 
(y 2 = 246.3, p < .00001) is the monomorphemic suffix /-kereba/ (supporting the 
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Paradigm 

Preferred 

form 

Runner(s)-up 

Predicted by category 
analysis 

present 

0 

-i 

abstract 4 

concrete (and traditional) 

past 

-katta 

-kat-ta 

concrete 

abstract 

adverbial 

-ku 

0 

concrete 

abstract (and traditional) 

provisional 

-kereba 

-kere-ba 

concrete 

abstract (and traditional) 

conditional 

-kattara 

-katta-ra 

-kat-tara 

concrete 

abstract 

abstract (and traditional) 


Table 9. Overall results for the adjective paradigm 


concrete analysis), while the runner-up is the bi-morphemic /-kere-ba/, predicted in 
part by both the abstract and traditional analyses, since both predict an internal mor¬ 
pheme boundary. The conditional form shows the most variability in the distribution 
of its forms, but the apparent winner is /-kattara/, predicted again by the concrete 
analysis. There is a statistically significant difference among the three top contenders 
(y 2 =24.8 ,p < . 0001), although not between the two runners-up, /katta-ra/ and /-kat¬ 
tara/ (y 2 = 2.61 ,p = 0.25.). However, both are bi-morphemic, which could give some 
limited support to the abstract analysis. 

These overall results are summarized in Table 9. As the table reveals quite clearly, 
the overall preferred analysis, the one which the participants in this study seemed to 
follow, is the more concrete analysis, with the abstract and traditional analyses play¬ 
ing some role in the ‘runners-up’ results. What are we to make of this? 

The fact that there is scatter in the data, especially among the relatively less fre¬ 
quent forms, suggests that a profound restructuring of the paradigm is well underway. 
The restructuring appears to have been pretty much complete in the more frequent 
forms (Table 7). Both the past and adverbial forms clearly represent a restructuring, in 
which the old stem-final Ik/ has been reassigned to the suffix, and the internal struc¬ 
ture of the past tense morphology has been erased, leaving simple monomorphemic 
suffixes. Restructuring is less evident for the infrequent forms, but the tendency is 
clearly underway, having advanced further in the provisional than in the conditional 
forms. A guiding principle seems to be that the entire system is gravitating to mono- 
morphemic representations. That is, the more abstract analysis, which recapitulates 
the history of the language, is being replaced by a more transparent, concrete analysis 
for contemporary native speakers. 

While it might appear that the present tense form is an exception to this general¬ 
ization, I do not think that is in fact the case. Being the most frequent form of the 
paradigm, but with the stem-final IkJ absent in it, though present in all the other 
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forms, the present tense seems the most fossilized in its monomorphemic status, a 
view reinforced by the traditional grammatical analysis. Thus, it may well be the most 
advanced member of the paradigm in the restructuring process. 

The less frequent forms, the provisionals and conditionals, appear to be the 
most unsettled members on the road to restructuring. The internal, historically 
valid, morpheme boundaries are giving way under pressure for monomorphemic 
status throughout the paradigm, but the tendency, especially within the condition¬ 
als, appears to be influenced strongly by issues of analogy. The conditional, after all, 
does share with the past tense form the sequence /katta/, so it is not surprising that 
the native speaker, especially in a task like the present one, would see that parallel 
and exploit it. Nevertheless, the monomorphemic form does win out, and the move 
toward monomorphemic adjectival suffixes seems well on the way to completion. 

5. conclusions. Three major conclusions can be drawn from this study. The first is 
that no matter how elegant and parsimonious a particular linguistic analysis might be, 
it does not necessarily explain the behavior of native speakers. If it means anything at 
all, ‘linguistic competence’ must tell us about how actual native speakers really do use 
their language and it must reveal empirically what they know and what they do. The 
abstract analysis, while elegant and appealing for its simplicity, fails in that task. 

Secondly, restructuring is taking place, and a guiding force seems to be semantic 
transparency at the surface. There is no reason for a native speaker to retain old mor¬ 
phemic boundaries, especially when complex sound changes (such as the lenition of 
velars before high vowels or the gemination of unlike consonants to like ones) take 
place. These sound changes obscure the earlier morpheme boundaries, as is clearly 
the case in the examples we have examined above. 

Thirdly, a cautionary note is in order. The experiment reported here does employ 
native speakers in an analytic task. It is possible, as it always is in experimental stud¬ 
ies, that the nature of the task itself is driving the results. In order to shore up the 
conclusions offered here, it is necessary to carry out further and more refined stud¬ 
ies. It might, for example, be useful to exploit more sensitive on-line methods such as 
lexical decision or phoneme monitoring tasks to try to get at the same phenomena. 
So, at this point, while our results are all statistically significant and quite compelling, 
they are still preliminary. 

Yet the question posed in the title of this paper seems to be answered, at least in 
the context of the Japanese adjectival paradigm: synchrony did at one point recapitu¬ 
late diachrony, but restructuring is well underway. And the best evidence is that the 
most revealing analysis of the Japanese adjective paradigm, reflected in the behavior 
of native speakers, is that the historical morpheme boundaries within the suffixes are 
being erased in favor of a more transparent, surface-oriented analysis. 


1 


In this paper, I return to a topic which I first attacked, rather unsuccessfully, some years 
ago (Prideaux 1976 ). I am grateful to Ms. Satomi Currah and Dr. Kaori Kabata for their 
help, as well as to the volunteer Japanese speakers who took part in the experiment. I 
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am also grateful to the anonymous referees who suggested important improvements. This 
research was supported in part by Social Sciences and Humanities Research Council of 
Canada Research Grant 410-96-0349. 

For convenience, I follow the grammatical nomenclature used in Jordan (1963). 

Of course, Table 2 does not exhaust the verbal paradigm, since there are numerous other 
members. It is rather intended to be illustrative of the full paradigm. 

As an anonymous reader pointed out, it might be argued that this is really a concrete 
analysis, not an abstract one, since the speakers might arguably have adopted a completely 
suppletive allomorphy here. This possibility is elaborated briefly below. 
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CATEGORIZING ENGLISH SUPERPOSITION 


Bernard Paul Sypniewski 
Rowan University, Camden Campus 


Superposition is a general theory that does not apply to any particular language 
but applies to all languages. Although it is part of Applicative Universal Grammar 
(AUG) Shaumyan (1987), superposition should interest all linguists because it pro¬ 
vides a framework into which many other theories, such as nominalization and gram- 
maticalization can be placed. By using superposition rather than notions from several 
disparate theories to explain complex linguistic behavior, we can simplify our theo¬ 
ries without any loss of rigor. When seen from the point of view of superposition 
theory, typology will be enriched, extended and made more flexible. This paper is not 
an attempt to convert linguists to AUG. Rather, it is part of a study of a general pro¬ 
cess that underlies a number of previously identified forms of linguistic behavior. One 
need not be an adherent of AUG to use superposition theory. 

Briefly, in superposition, a secondary function 1 of a linguistic expression 2 E is 
overlaid upon its primary function 3 ; Shaumyan (1987), Sypniewski (1997), creating a 
new, bistratal function that allows E to perform some aspects of both primary and 
secondary functions simultaneously. Superposition occurs when the context requires 
it to occur. A superposer operates on a superponend. A superposer is that linguistic 
entity or device that causes a superponend to be superposed. For example, the super¬ 
poser -ing operates upon the superponend turn to produce turning. The superposer 
overlays the secondary function of term on the primary function of predicate, e.g., 
the turning of the seasons. As this paper shows, a superposer may be but need not be 
a portion of a word such as an affix. 

Superposition gives language a simple, flexible, and robust tool for extending the 
lexicon. For example, although -ing in the example above superposes a secondary 
function of term on a primary function of predicate in turning, the same superposer 
may superpose a different function on the same word in a different context, e.g., the 
turning wheel. Superposition operates within a given context and is dependent on that 
context for the result that it produces. Superposition is not the only mechanism used 
to expand the lexicon, of course, but superposition is one of the common mecha¬ 
nisms used to create new expressions. Superposition is conservative; it uses already 
existing linguistic material to form new expressions. Superposition, by itself, does 
not create permanent entries into the lexicon. Superposition occurs only within a 
limited context, i.e., within a single expression. Superposition reoccurs every time 
that the expression is repeated. If a word or expression W is not repeated a sufficient 
number of times by a sufficiently large portion of the linguistic community, W will 
not become part of the permanent lexicon. On the other hand, if an expression meets 
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the needs of the linguistic community, it will be repeated often enough to become 
part of the permanent lexicon 4 . Thus, it is the combination of superposition and rep¬ 
etition that creates new entries into the lexicon. Even when the expression becomes 
part of the permanent lexicon, the superposition can still be seen. Superposition is 
not only implicated in changes in word function but also in changes in word mean¬ 
ing 5 (see Sypniewski 2000). Most of the examples in this paper are examples of func¬ 
tional superposition. However, the semantics of an expression can also superpose an 
expression (ibid). 

Superposition is not just another name for nominalization or grammaticaliza- 
tion. If anything, nominalization and grammaticalization are subsets of the theory of 
superposition. According to AUG, any expression may be superposed, not just nouns 
or verbs. While it is beyond the scope of this paper, it is likely that expressions can 
be superposed to perform functions beyond those of the traditional linguistic cat¬ 
egories. This paper describes the simple, generalized mechanisms used in the pro¬ 
cess of superposition in English. Other languages may not use superposition as freely 
as English or may not use both types of superposition mentioned in this paper to 
the same degree or in the same way as English. There has been some work done on 
nominalization in Thai that may prove useful for the understanding of superposi¬ 
tion Prasithrathsint (1994); Prasithrathsint (1997). New abstract nominal expressions 
have developed in Thai within the last century. Some scholars believe that these new 
nominal expressions result from Thailand’s exposure to the West and the increasing 
familiarity of Thai speakers with English. What is of interest for us is that the new Thai 
nominalizations use material superposers, e.g., /kann/ and /khwaan/ are garmmati- 
calized morphemes or prefixes. They derive from terms meaning ‘affair’ and ‘matter’, 
respectively 6 . Prasithrathsint’s interesting studies demonstrate that semantics (the 
reasons why the nominalizations occurred) of the host language (Thai) expressions 
maybe intended to model the semantics of another language (English) but the super¬ 
positions occurred in a manner appropriate to the grammar of the host language. 

Superposition may allow us to categorize languages. We can create a continuum 
with languages that use only material superposition on one end and ones that use 
only contextual superposition on the other. A language L 7 appears on this continuum 
depending on some measure of the utility of both types of superposition to the speak¬ 
ers of L. This method of grouping languages may provide very different results than 
the traditional ‘family of languages’ since such a continuum would relate languages by 
an aspect of their functional similarity rather than by their phonetic similarity. 

1. material and contextual superposition defined. There are two broad catego¬ 
ries of superposition, which I shall call material superposition and contextual superpo¬ 
sition. Superposers from both categories may operate in tandem in the same linguistic 
context but no individual superposer is a member of both categories. Material super¬ 
position overlays a secondary function onto the primary function of a word 8 W due to 
some change in, addition to, or subtraction from, the structure of W. The sign indicat¬ 
ing the structural change is the superposer 9 . Turning is an example of material super- 
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position because it involves the addition of a suffix to a root; the suffix operates on 
turn to produce turning. In English, suffixes are the most obvious but not the only 
material superposers. Structure refers to any aspect of the morphology, phonology, 
stress, accent, pronunciation, inflection, intonation 10 , or writing of W. The differences 
in the pronunciation of the words minute and arithmetic in (1) serve to superpose the 
secondary function of term modifier onto the primary function of term in the second 
instances. 

(1) a. minute by minute/a minute change 
b. arithmetic/arithmetic function 

Contextual superposition overlays a secondary function upon the primary function 
of W in a linguistic expression E by the change of, addition to, or subtraction from, 
E. W need not be altered at all. In the first instance of Example 1, the word minute 
has been placed before change u , as though it was a term modifier. In the phrase, Zoo 
Run 12 the context imposes a secondary function of term modifier on Zoo and a sec¬ 
ondary function of term on Run. Neither word is altered in any way. The context 13 is 
the superposer in this example. 

Material superposition is internal to W. Contextual superposition is external to 
W. Material superposition affects a single word. Contextual superposition may affect 
more than one word at a time. In our Zoo Run example, one contextual superposer 
affects both words. We may say that the contextual superposer is the placing of the 
word Zoo before the word Run. A context is the relation of each word in a linguistic 
expression to the expression as a whole. Every linguistic expression must have at least 
one word or, in spoken language, one utterance. The one-word sentence Stop! is com¬ 
plete as both a sentence and a linguistic expression. Stop is a predicate. AUG pos¬ 
tulates that all sentences (s-types) must have something that performs a predicate 
function. The tone of voice, if uttered, or the exclamation point, if written, reinforce 
the notion that stop is a command, hence a complete sentence. We may say that the 
context is the fact that stop is the sole word in the sentence or that stop is accompanied 
by strong inflexion and, perhaps, loud tone. The context superposes the secondary 
function of s 14 on stop’s primary function of predicate. 

An expression may use more than one incidence of superposition. In running shoe, 
running is doubly superposed. Running is materially superposed by the suffix -ing, 
overlaying the secondary function of term on run s primary function of predicate. 
Running is contextually superposed by occurring before or being spoken before 
the term shoe. The context overlays the function of term modifier on the already 
superposed running. We may think of all instances of superposition in a linguistic 
expression as occurring simultaneously or as occurring one after the other, like the 
composition of functions in mathematics. In any event, our brains process language 
so quickly that all instances of superposition in an expression are, for all practical 
purposes, simultaneous. 
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2. a brief description of aug’s type system. Words have types because of the func¬ 
tions that they perform in a context 15 . Words have primary, secondary, and even tertiary 
types. The primary type of W is the function that W most frequently performs. Words 
are not the only linguistic objects that have types. An entire expression has a single 
type 16 . For example, what is traditionally thought of as a noun phrase is, for AUG, an 
expression with the type t or term. AUG describes two broad functional categories on 
the genotype level 17 : operators and operands. Operator is a general word that refers to 
a linguistic function. An operand is a linguistic entity on which an operator operates 
(called an operation). Every operation produces a result, called the resultant. 

Every expression is an example of the nucleus-satellite relation. Every expression 
has a nucleus and may have one or more satellites. The nucleus of a nucleus::satellite 
relation is a term for expressions whose type is term and predicate for expressions 
whose type is either predicate or s-type (a sentence). The function of predicate is to 
be the nucleus of a sentence. If a word or expression F functions as the nucleus of 
a sentence, F has the type 18 predicate regardless of whether F has the primary type 
of predicate. A noun phrase 19 has a term as its nucleus. If a word or expression K 
functions as the nucleus of an expression with the type term, K has the type term 
regardless of K’s primary type. A term serves the function of being an operand for a 
predicate, and, in that sense, is the satellite of a predicate. From this discussion, we see 
that if F does not have the primary type of predicate, F is superposed to predicate. If K 
does not have the primary type of term, then K is superposed to the type term. Mate¬ 
rial superposition, therefore, operates by changing the nucleus-satellite relation. 

(2) The Zoo Run raised $10,000 for the Philadelphia zoo. 

In (2), the first instance of the word zoo is superposed by being placed in a satellite 
relation with the word run. Normally, zoo is a term and, therefore, the nucleus of an 
expression with the type term. Zoo is superposed because it is the satellite 20 of run. Run 
is superposed because it is the nucleus not of a sentence but of a term. The word Phila¬ 
delphia is contextually superposed because it is the satellite of the second instance of 
zoo. (2) demonstrates that superposition is ‘local’, i.e., superposition operates within 
a narrow context. While the first instance of zoo in Example 2 is superposed, the 
second instance of zoo is not affected by the superposition. For the purposes of this 
discussion, we will assume that ‘context’ means ‘nucleus::satellite relation. Superposi¬ 
tion operates on expressions rather than words even if the expression consists of one 
word. Several instances of superposition may occur at once, e.g., Zoo Run or running 
shoe. AUG claims that the types of words and expressions are dynamic. Most linguistic 
theories assume that the types of words and expressions are static. 

Some words seem easier to superpose than others are. A good example is the word 
run. Some examples of the superposition of run are found in Table 1. 

Functional superposition (Sypniewski 1997) often affects the semantics of a lin¬ 
guistic expression and vice versa (ibid 2000), as can be seen in Table 2 and in the 
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1 

The boy ran to the store. 

No superposition 

2 

The running shoe 

Material superposition (-ing) and contextual superposition 

3 

The Battle of Bull Run 

Contextual superposition 

4 

A run in her stocking 

Contextual superposition 

5 

No hits, no runs, no errors 

Contextual superposition (and material superposition (-s)? 

6 

The runner 

Material superposition (-er) (and contextual superposition?) 

7 

Who runs London? 

Semantically superposed but not functionally superposed 


Table 1. 


1 

running shoe 

running joke 

2 

run of the mill 

run in her stocking 

3 

the runner on the field 

the runner on the steps 

4 

running the government 

running the sheep 

5 

the engine is running 

my nose is running 


Table 2. 

seventh example in Table 1. While structurally similar, are these pairs of expressions 
semantically the same? 

Semantic superposition complicates the theory of superposition. Semantic super¬ 
position is not a completely worked out theory but, at this stage, it is sufficiently 
well-developed to say that the general theory of superposition cannot be under¬ 
stood without taking semantic effects into consideration. Table 1 (except for #1) con¬ 
tains examples of functional superposition. In Table 2, there is no functional change 
between members of the same pair but sometimes the semantic difference is substan¬ 
tial, as in #3 and #4. In #3, the first phrase refers to a person and the second phrase 
refers to an object (a rug of some sort). In #4, the first phrase refers to operation; the 
second phrase refers to physical movement or managing physical movement. 

3. uses of superposition. We may see the uses to which English speakers put super¬ 
position by examining some difficult-to-superpose words. A good example is antihis¬ 
tamine. Antihistamine is a term; the word refers to a class of drugs. All terms are easy 21 
to superpose to term modifiers. Antihistamine is no exception: antihistamine tablet. 
It may be difficult to see how antihistamine can be superposed any further. Unlike 
run, which can be easily superposed to a term, antihistamine does not describe any 
activity or action 22 that would allow us to easily superpose it to a predicate. However, 
someone might say: The doctor antihistamined me. This is certainly not an ordinary 
use of antihistamine but the sentence is understandable, perhaps requiring a moment 
to grasp its meaning 23 . The sample sentence can refer to a doctor’s providing or sug¬ 
gesting antihistamines to a patient for some ailment. I do not suggest that this use will 
ever become an accepted part of the general lexicon, but the sample sentence demon¬ 
strates that superposition may be used in the creation of new words or phrases. Here 
are four more examples of similar linguistic behavior: 
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(3) a. New Jersey Network is proud to partner with the Division of Youth and 
Family Services. 

b. Kinko’s: its a new way to office. 

c. We repurposed the data 

d. XYZ Corporation took over ABC Corporation in a leveraged buyout. 

In Example 3, terms ( partner, office, (re)purpose, and leverage ) are used as predicates 
or, in the case of leverage, contain aspects of the predicate in their new roles. Super¬ 
position frequently occurs in the creation of new words or phrases. A language group 
often creates new words out of compounds of old words. The lexicon greatly expands 
by the compounding of old words into new words or expressions that are used as one 
word. Common animal and plant names form a large group of words made in this 
way. Common or ‘folk’ names are generally deprecated by the scientific community, 
yet they show a remarkable tendency toward describing the community’s idea of the 
important characteristics of a plant or animal. The modifier horse in (4)e and (5)e 
refers to the large size of the chestnut or reed or, more specifically, to their flowers or 
foliage. The skunk in skunk cabbage, U)d, refers to the delightful aroma of the plant. 


(4) a. Lightning bug 

b. Gypsy moth 

c. Honey Bee 

d. Skunk cabbage 

e. Horse chestnut 


(5) a'. Stink bug 
b'. Luna moth 
c'. Carpenter Bee 
d'. Skunk 
e'. Horse tail 


(4) can be extended almost indefinitely. Superficially, the members of (4) can be 
thought of as two-word phrases, but each entry is one name. Substitute or remove a 
word and a different animal or plant comes to mind, as in (4). All of the examples in 
(3) and (4), except for (4)d, are examples of contextual superposition. (3)a and b also 
contain examples of material superposition. Many common words are, in reality, two 
words used as one, e.g. afternoon, alleyway, basketball, hatbox, and pitchfork. 

I extracted 24 a subset of 250 compound words from a corpus of about 1500 exam¬ 
ples of superposition. In building my corpus of superposition examples, I have tried 
to get a random selection of authors and types of texts so as to minimize the impact 
that the peculiarities of any authors or text types might have on the conclusions that I 
might draw. By considering each compound word as two words that happened to be 
joined together, I examined the type of the first word, the type of the second word, 
and the resultant type. If this small sample is typical of compound words, superposi¬ 
tion is heavily skewed towards the use of and production of terms. This finding is not 
unexpected since we use language to describe and symbolically manipulate the world. 
Terms allow us to refer to parts of our experience in a sort of shorthand. Terms 
are easy to superpose into term modifiers, e.g., ship (term), ship mate (term modi¬ 
fier), and frequently into predicates, e.g., to ship (predicate) 25 . Superposition allows 
us to group concepts. We can use superposers to describe an object O, some aspect 
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of the existence of O, and some action performed with or to O without substantially 
expanding the lexicon. In this sense, superposition is a conservative mechanism that, 
on one hand, increases the scope of the lexicon by increasing the flexibility of expres¬ 
sions, and, on the other hand, keeps the size of the lexicon manageable. 

In some types of writing, superposition occurs quite frequently. One text that I 
selected was Gilbert Waldbauer’s Insects Throughout the Seasons. I isolated a number 
of examples of compound nouns. In one fifty page segment of Waldbauer's book, 
I counted 213 compound words that resulted from superposition. At an average 
of 370 words per page, superposed compound words amounted to 1.32% of the 
text (213/18500). One may argue that compound words like seahorse or highrise are 
individual words and not, therefore, superposed. One must, however, acknowledge 
that there is no difference between sea horse and seahorse or high rise and highrise. 
At one time, these words were superposed. Here, the superposer is the context that 
associates two words as parts of a single name. In each case, as in (4) and (5), a change 
in either part of the compound changes the semantics of the entire expression. That 
many compound words are spelled as one word rather than as two is merely the 
recognition of their use as a single name. These examples, and from experience 
we can think of hundreds more like them, demonstrate the role of superposition 
in expanding the lexicon. If a superposed expression E results in E s and E. proves 
useful to the linguistic community, E s may be adopted as a word or expression in 
its own right. Will office become a predicate as in (6)? I have not heard it used 
as a predicate except for the one commercial mentioned. What about partner or 
repurpose ? Certainly, they are used more often. Many words go in and out of fashion. 
Fashion or necessity is determined by the linguistic community. Superposition is the 
mechanism that allows the free creation of new, temporary 26 , expressions. It is often 
easier to ‘extend’ a meaning than to create a new expression from whole cloth. We 
often hear the results of superposition in 

(6) a. slang, e.g., offed (slang for killed), witch hunt, a done deal 

b. technological jargon, e.g., to upgrade software, disk cache, object oriented 
programming 

c. fashion, e.g., stiletto heels, tailored twills,flirty cocktail dress 

d. entertainment, e.g., blonde bombshell, blockbuster 27 , hiphop 

e. sports, e.g., a three-pointer, round-robin, dugout, touchdown, round-tripper, 
shot put, fly out 

f. publishing, e.g., book jacket blurbs, Martha Stewart Living 
The sentences in (7) were culled from recent publications: 

(7) a. Looks like were going for a kind of‘Partridge Family’ thing 28 . 

b. “I hate to shiver”, says merchant Cathy, “so I found a rugby weight turtle 
that’s as cozy as a cup of cocoa. 29 ” 

c. Soaring island real estate triggers reassessment 30 . 
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(7) d. To be very rich or very lovely or very bad or very sure or very good is 

inherently interesting, since veryness of any kind is not part of dailyness 31 . 

e. He really knows food but he’s not a snootie foodie 32 . 

f. SQL lets you combine as many expressions as needed to form your where 
clause 33 . 

3. conclusion. As Example 4 clearly demonstrates, English expression creation is 
very flexible. The flexibility comes from superposition. None of the expressions in 
the examples in this paper are self-conscious. Superposition is not something that 
is learned. Superposition describes a real, underlying mechanism in speech. In some 
contexts, e.g., (7)0, superposition is so common that the expression may not be able to 
be stated otherwise or with such efficiency. 

This paper has described the two broad categories of superposition in English 
speech. However, I have not accounted for the role that gestures, other verbal cues, 
or contexts other than traditional linguistic contexts might play in superposition. It 
is possible that there are other categories of superposition besides those mentioned 
here. 


‘Function’ is usually taken to mean syntactic or grammatical function. However, recent 
work suggests that ‘function might also have semantic aspects. 

For the purposes of this paper, a linguistic expression means one or more words in a context. 
AUG is a functional theory of grammar. Word or expression types are considered to be 
functions. For example, what most linguists call a ‘noun’ is, in AUG, a term, a ‘verb’ is a 
predicate, and an ‘adjective’ is a term modifier. 

There is almost linguistic Darwinism here. Words or expressions that are ‘fit’ are repeated 
and become part of the permanent lexicon. Words or expressions that are not ‘fit’ are not 
repeated and, therefore, do not survive. 

When I mention ‘word meaning’ in this paper, I am not suggesting that words are the car¬ 
riers of meaning. This position has been discredited. I merely wish to use a shorthand 
expression to describe the semantic properties or processes generated when a word or 
expression is uttered. 

Prasithrathsint, personal correspondence. 

It is theoretically possible that a language L exists such that L does not use any superposi¬ 
tion. While I have not done an exhaustive survey of the superposition use of all languages 
(I don’t know how one could, given the linguistic problems involved and the duration of 
the human life span), I would be greatly surprised to find one. 

I will not define word in this paper. The reader may use the broadest definition of word. 
The reader should understand, however, that the definition of word has a number of impli¬ 
cations for the theory of superposition and AUG in general. Those implications will not 
be discussed in this paper. 

A material superposer changes the physical characteristic of a word. The superposer oper¬ 
ates upon the word, which is its superponend, to produce a resultant, which is a new word 
or expression. A superposer is always a sign. AUG is a semiotic theory of language. 
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10 I will not discuss whether superposition occurs when only the semantics of a word or 
expression are modified as, for example, in the various senses of Oh, that’s really nice 
caused by stress or intonation. 

11 Depending on one’s definition of word, one may say that change is also contextually super¬ 
posed if one thinks that the primary function of change is predicate. 

12 The Zoo Run was the name of a jogging event used to raise money for the Philadelphia 
Zoo. 

13 I will not discuss the thorny question of whether context is a sign. 

14 In AUG, if an expression functions as a sentence, the expression has type s. 

15 To say this another way, words have no inherent type. They get their type, i.e., their func¬ 
tion, from the context that they are in. The notions of primary, secondary, tertiary, etc. 
types is a concession to the obvious fact that, through use, we become familiar with a word 
W performing a function F more often than it performs the function F'. 

16 Expressions and sentences get their types through a process of reduction. Reduction will 
not be discussed further in this paper. 

17 AUG claims that languages exist on two levels: the genotype and the phenotype. The geno¬ 
type level is the most abstract level of language. The genotype level is the same for all 
languages. The phenotype level is the language-specific expression of the genotype. Super¬ 
position occurs on the genotype level of language. 

18 AUG describes one-, two-, and three-place predicates, having the notation Ots, OtOts, and 
OtOtOts, respectively. In this paper, I use predicate to mean any of the three types just 
mentioned. 

19 AUG does not recognize noun phrases, etc, per se. A noun phrase is just another term. In 
the discussion, I use noun phrase to distinguish between a term that is composed of more 
than one word and a term that consists of one word only. 

20 The complete context or nucleus::satellite relation is The zoo run. The word the is non- 
superposible and creates the expectation that what follows it has the type term. 

21 Terms are so easy to superpose into term modifiers that it is often difficult to say whether 
a term modifier is superposed to a term or a term is superposed to a term modifier. 

22 Antihistamine may, perhaps, be thought of as a type of medication that causes an action 
(or prevents an action). However, it does not refer to the action itself. 

23 I think that the pause referred to is an important linguistic fact. However, I will not discuss 
time in the study of language in this paper. 

24 The study presented here is preliminary. I will not describe my methodology or corpus in 
detail. The work is continuing. 

25 These last sentences should not be read to suggest that all terms can easily be superposed 
into term modifiers or predicates. A simple example is the word desk. Some words do not 
seem to have more than one type, e.g., whopping, or are used so frequently to perform their 
primary function that we may consider them to be nonsuperposible. (Sypniewski 2000) 

26 Repetition turns a temporary expression into a ‘permanent’ one, i.e., an enduring part of 
the lexicon. 

27 Block busting in the context of illegal, discriminatory activity means something entirely 
different. This clearly demonstrates the role of superposition in semantic context. 

28 Mankiewicz, John. November 15,1999. Stock Market Follies. New Yorker. 

29 Land’s End catalogue, Winter, 1999 
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30 headline. Cape May County Herald. October 27,1999. 

31 Gopnick, Adam. August 9,1999, ‘The Stuff of Fame’, Comments. New Yorker. 

32 Schellinger, L. November 15,1999. Ming is Rising. New Yorker. 

33 Medinets, David. 2000. PHP 3 programming browser-based applications. New York: 
McGraw-Hill (page 134) 
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ITERATION AND GENRE IN A 19 TH-CENTURY NOVEL 


Michael Cummings 
York University, Toronto 


Among many celebrated minor Dickens characters is Mr. Micawber of David 
Copperfield, appreciated by the general public as a man for whom something must 
assuredly turn up, although he seems forever destined for further disappointment. 
He is less generally remembered as a stylist, although the following excerpt from a 
personal letter so distinguishes him: 

(1) ‘my dear sir, 

‘Circumstances beyond my individual control have, for a considerable lapse 
of time, effected a severance of that intimacy which, in the limited oppor¬ 
tunities conceded to me in the midst of my professional duties, of contem¬ 
plating the scenes and events of the past, tinged by the prismatic hues of 
memory, has ever afforded me, as it ever must continue to afford, gratifying 
emotions of no common description. This fact, my dear sir, combined with 
the distinguished elevation to which your talents have raised you, deters 
me from presuming to aspire to the liberty of addressing the companion 
of my youth, by the familiar appellation of Copperfield! It is sufficient to 
know that the name to which I do myself the honour to refer, will ever 
be treasured among the muniments of our house (I allude to the archives 
connected with our former lodgers, preserved by Mrs. Micawber), with 
sentiments of personal esteem amounting to affection. (Project Gutenberg 
1996a: Ch. 49) 

Both lexical and syntactical aspects of this extraordinary style have a lot to do with 
Latinism, but there is also a distinguishing textual element, which is a very low inci¬ 
dence of anaphoric co-reference and co-classification (Halliday & Hasan 1985:73-75), 
that is, little referential iteration, i.e., iteration of the same semantic referent. There is 
little iteration of the same lexical item, and there is little semantic iteration with pro¬ 
forms, beyond a number of ist-person references to the writer himself. This aspect of 
style could usefully be contrasted with the opposite characteristic in opposite person¬ 
alities within the same work: 

(2) ‘I’ll tell you what I am,’ whispered Mr. Creakle, letting it go at last, with 
a screw at parting that brought the water into my eyes. ‘I’m a Tartar.’ 

‘A Tartar,’ said the man with the wooden leg. 
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‘When I say I’ll do a thing, I do it,’ said Mr. Creakle; ‘and when I say I will 
have a thing done, I will have it done.’ 

‘—Will have a thing done, I will have it done,’ repeated the man with 
the wooden leg. 

‘I am a determined character,’ said Mr. Creakle. ‘That’s what I am. I do my 
duty. That’s what I do. My flesh and blood’—he looked at Mrs. Creakle as he 
said this—‘when it rises against me, is not my flesh and blood. I discard it. 
Has that fellow’—to the man with the wooden leg—‘been here again?’ 

‘No,’ was the answer. 

‘No,’ said Mr. Creakle. ‘He knows better. He knows me. Let him keep 
away. I say let him keep away,’ said Mr. Creakle, striking his hand upon the 
table, and looking at Mrs. Creakle,‘for he knows me. Now you have begun 
to know me too, my young friend, and you may go. Take him away.’ 

(Project Gutenberg 1996a: Ch. 6) 

Mr. Creakle and the man with the wooden leg both suffer from an extreme fondness 
for referential iteration, which they make to be a sinister symbol of oppression and 
implacability. 

The kind of referential iteration which concerns this paper is the iteration of a ref¬ 
erence first introduced in lexical form—thus ‘lexical iteration’, which may occur in the 
form of recurring lexis, or as a pro-form. Micawber speech and prose has a very low 
iterative density, and Creakle speech an extraordinarily high one. Iterative density, 
in similar fashion to lexical density (Halliday 1994:350-52), can be quantified as the 
number of iterations through a designated stretch of text. Different parts of the same 
text can be shown to have different iterative densities by comparing the number of 
lexical iterations within stretches of similar length. 

1. a method for measuring iterative density. In a previous paper, I have described 
a computer methodology for measuring the change in iterative density continuously 
through a long text like Copperfield (Cummings 1998). The Prolog language software 
reads the input text character by character and reconstitutes words as Prolog lists 
(Clocksin & Mellish 1984:4^.) by taking spaces and punctuation as boundary mark¬ 
ers. Word lists for lexical words are accumulated in a span list of a size set by the 
operator at run time. Each such word is parsed to distinguish lexical morphemes from 
grammatical, and each successive lexical item is compared with the previous items in 
the span to determine any iteration. Additionally, various pro-forms are recognized as 
such and reckoned also to be referential iteration. All words are output again as con¬ 
tinuous text, with original spacing and punctuation; but each referential iteration is 
marked with asterisk. At the same time the density of iterations is measured by count¬ 
ing the number of iterations recognized within each successive text segment contain¬ 
ing 100 formal items which are potentially iterative, i.e., lexical and pro-form. The end 
of each such segment is marked in the output text with the serial number of the seg¬ 
ment and a numerical figure for the iterative density of that segment. 
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This method has limitations which are reflected in the output results. The com¬ 
puter doesn’t know that lexical items are not always realized as single words. Neither 
does it recognize ellipsis, the distinction between pronominal and dummy subject 
‘it’, or that between deictic and dummy subject ‘there’. The parsing of lexical and 
grammatical morphemes depends on a very limited algorithm which doesn’t always 
correctly manage the plural of nouns and some other sorts of morphotactics. The seg¬ 
ments of text measured for density are exclusive, not continuously overlapping, so 
that the segmentation of the text is a kind of sampling. 

Although the main subject of this paper is not Copperfield, but a much shorter and 
therefore more manageable Dickens novel, Oliver Twist (Project Gutenberg 1996b), 
the computer results can be illustrated with an output segment from the processing 
of Copperfield which constitutes the analysis of the Creakle passage (with the lexical 
word iteration span set for 30; Cummings 1998:258 and nt. 2): 

(3) ‘T’ll [180-48] tell you what *i am,’ whispered mr. Treakle, letting Tt go at 
last, with a screw at parting that brought the water into *my eyes. 

‘*i’*m a tartar.’ 

‘a * tartar,’ said the man with the wooden leg. 

‘when *i say T’Tl do a thing, *i do Tt,’ said mr. Treakle; ‘and when 
T *say *i will have a Thing done, T will have Tt *done.’ 

‘- will have a Thing Tone, *i will have Tt *done,’ repeated the *man 
with the *wooden Teg. 

‘T am a determined character,’ said mr. Treakle. ‘that’s what T 
am. T do my duty, that’s what T do. *my flesh and blood’ - Tie 
looked at mrs. Treakle as *he said this - ‘when Tt rises against Tne, 
is not Tny Tlesh and Til 00 d. T discard Tt. has that fellow’ - to 
the Tnan with the Tvooden Teg -’been here again?’ 

‘no,’ was the answer. 

‘no,’ said mr. Treakle. ‘*he knows better. *he *knows *me. let *him 
keep away. T say Tet *him *keep away,’ said mr. Treakle, striking 
Tiis hand upon the table, and looking at mrs. Treakle,‘for Tie *knows 
[181-62] *me. now you have begun to *know Tue too, Tny young friend, 
and Tou 

This selection from the output includes all of segment #181, which has a density of 
62 recognized iterations, each of which is preceded by an asterisk. By comparison, 
the segment which contains most of the above quotation from Mr. Micawber’s letter 
(#1554), contains only 23 recognized iterations, 14 of them within the text quoted. 
Examination of the output segment reveals the effect of some of the limitations in 
the computer method. The computer has been instructed that ‘am’ and ‘will’ are to be 
regarded as function words, not lexical items, but has not been instructed that ‘“m” 
and ‘“11” are variants whose repetition should not be marked. The computer has been 
instructed that ‘do’ is more probably a function word, and so misses the iterations of 
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1:1-25 2-15 3-15 4-34 5-32 6-20 7-27 8-26 9-18 10-19 11-16 12-38 13-38 14-35 15-30 16-37 
17-25 . . . 862-23 863-25 864-23 865-21 866-17 867-24 868-24 

II: 8-321 8-794 n-322 11-814 12-329 12-594 13-45 13-173 13-320 13-599 13-795 13-854 
14-215 ... 43-66 43-95 43-210 44-636 44-718 45-161 45-731 47-644 47-749 48-807 
49-377 

III: 32-320 36-319 40-793 42-794 44-513 47-597 49-249 51-723 52-70 52-388 52-693 
53-9 53-172 • • • 116-642 116-705 116-747 118-635 118-704 119-730 119-748 


Figure 1. Condensed logs of segment densities in Oliver Twist 
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Figure 2. Frequency distribution graph for segment densities in Oliver Twist. 
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lexical ‘do’ in this passage. The computer has also been told that ‘that’ is more likely a 
modifier than a pro-form, and so it misses the use of‘that’ when it is a pro-form. The 
results are therefore heuristic rather than absolute; in this case the computer method 
has actually underestimated the degree of referential iteration. 

The purpose of this paper is to show how with the help of the computer, however 
imperfect, the pattern of iterative densities in a long text can be correlated with other 
stylistic variables, specifically genre variables (Eggins 1994:25-48; Martin i992:546ff). 
Another kind of output produced by the computer is very useful for this purpose: a 
log of the densities in each segment. This takes three forms, which are illustrated in 
Figure 1, for Oliver Twist. The first type of log is a list of all the densities recorded for 
the 868 segments of Twist in serial order. The second type is the same list ordered 
from the lowest densities to the highest, with the order of serial number and density 
figure reversed. The third type is like the second, but each pair of figures represents an 
accumulated density for three consecutive segments ending with the segment desig¬ 
nated by the serial number. 

Yet another computer output is represented in Figure 2, a frequency distribution 
graph for the various densities. In this graph, each density, from the lowest to the 
highest, and the number of times it occurs are shown in the left-hand column. 
The purpose of these secondary outputs is to enable the identification and compari¬ 
son of segments with similar degrees of referential iteration, with a view to making 
stylistic statements about genre variables instantiated in such segments. 

2. iteration and genre. In order to make stylistic statements about segments of 
text from Oliver Twist whose iterative densities have been measured, it is necessary 
to characterize these segments as belonging to one genre or another. The whole 
text of Twist can be described in terms of just five elementary genre categories, narra¬ 
tive, descriptive, expository, monologue, and dialogue. The most elementary features of 
these categories can be illustrated in examples of no more than two or three consecu¬ 
tive sentences each. Narrative, as in example (4), has the feature sequence of events 
in time. Descriptive, as in example (5), lacks sequence of events in time, but shares 
with narrative the feature visualizable scene. Example (6) represents expository genre, 
with the feature abstract facts and evaluations (on the part of the novelistic narra¬ 
tor). Example (7) represents monologue, being an extended quoted address by one 
fictional character. Dialogue, as in example (8), has the feature quoted address by 
two or more characters to one another. 

(4) Narrative 

as They approached the city, the noise and traffic gradually increased; 
when "They threaded the streets between shoreditch and smithfield, *it 
had swelled into a roar of sound and bustle. 
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(5) Descriptive 

"it was market-morning, the ground was covered, nearly ankle-deep, with 
filth and mire; a thick steam, perpetually rising from the reeking bodies 
of the cattle, and mingling with the fog, "'which seemd to rest upon the 
chimney-tops, hung heavily above, all the pens in the centre of the large 
area, and as many temporary "pens as could be crowded into the vacant 
space, were filled with sheep; tied up to posts by the gutter side were long 
lines of beasts and oxen, three or four deep. 

(6) Expository 

i need [172-20] hardly beg the reader to observe, that this action should 
tend to exalt "them in the opinion of all public and patriotic men, in 
almost as great a degree as this strong proof of "their anxiety for "their 
own preservation and safety goes to corroborate and confirm the little 
code of laws "which certain profound and sound-judging philosophers 
have laid down as the main-springs of all nature’s deeds and actions ... 

(7) Monologue 

‘bill, dear "bill, "you cannot have the heart to kill "me. oh! think of all 
"i have given up, only this one "night, for "you. "you shall have time 
to "think, and save yourself this crime; "i will not loose "my hold, "you 
"cannot throw "me off. 

(8) Dialogue 

‘I mean to say that "i shouldn’t,’ "replied "tom, angrily, ‘"there, now. ah! 
"who’ll "say as much as that, "i should like to know; eh, "fagin?’ 

nobody, "my "dear,’ "replied the "jew; not a soul, "tom. "i don’t "know 
one of 'em that would do "it besides you; not one of‘"em, "my "dear.’ 

In order to evaluate the whole output for correlations between iterative density 
and genre variation, the 868 segments in the output file were reordered in a word 
processor by density from lowest to highest. The resulting arrangement corresponds 
to the pattern represented by the second type of log in Figure 1 and in the frequency 
distribution graph of Figure 2. Then four samples, each of 30 successive segments (32 
in Sample 4), were taken from the resulting file. The location of these samples along 
the density axis is represented in Table 1. 

The selection of segments to sample is thus based on a compromise between the 
distribution of densities and the numbers of segments. In terms of densities, the gap 
between samples is fairly constant, at 7 or 8. In terms of the number of segments, 
the first and last samples mark the ends of the file, the second and third samples are 
equally spaced from the ends and from the centre. 

After the samples were isolated, each of the segments in the samples was coded for 
genre type. When a segment contained a boundary between two genres, the segment 
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Sample 

Segments 

No. of Segments 

Range of Densities 

Sample 1 

1-30 

30 

8-15 

not in samples 

31-209 

179 

15-23 

Sample 2 

210-239 

30 

23 

not in samples 

240-627 

388 

23-31 

Sample 3 

628-657 

30 

31-32 

not in samples 

658-836 

179 

32-39 

Sample 4 

837-868 

32 

39-49 


Table 1. Sampling of segments measured for iterative density. 


was labelled with the genre name that represented a clear preponderance of text. 
If there was no clear preponderance, then the segment was labelled mixed. Addi¬ 
tionally, segments which did not belong to dialogue or mixed genre were labelled 
as either concentrated or diffuse. A segment is concentrated if it focuses on just one 
or two concrete (i.e., not abstract) participants, and diffuse if it focuses on more than 
two. The formal criterion for focussing on concrete participants is the occurrence of 
such participants as thematic Subject elements in independent clauses. (A thematic 
Subject is a grammatical Subject which is not inverted to a post-main-verb position; 
cf. Berry 1996.) Dialogue genre was reckoned to focus inherently on two concrete 
participants, the parties to the dialogue, and thus to be inherently concentrated. 

The data indicate that segments of whatever genre coded as diffuse tend to have 
low iterative density, and segments regarded as concentrated tend to high density. 
Good illustrations are provided by examples (9) and (10), diffuse descriptive and con¬ 
centrated monologue segments respectively. Example (9) with an iterative density 
of 8 has both narrative and descriptive elements, with descriptive clearly prevailing. 
It is diffuse insofar as thematic Subjects of independent clauses include ‘Mr. Sikes’, 
‘The ground’, ‘a thick steam and several others. Example (10) with density of 41 has 
narrative, monologue, and dialogue elements, with monologue prevailing. It is con¬ 
centrated in the sense that thematic Subjects of independent non-dialogue clauses 
include only Nancy and Sikes. 

(9) Description, diffuse 

square, mr. sikes struck, by way of chiswell [320-13] "'street, into barbican: 
thence into long lane, and so into smithfield; from "which latter place 
arose a tumult of discordant sounds that filled Oliver twist with amaze¬ 
ment. 

"it was market-morning, the ground was covered, nearly ankle-deep, 
with filth and mire; a thick steam, perpetually rising from the reeking 
bodies of the cattle, and mingling with the fog, "which seemd to rest 
upon the chimney-tops, hung heavily above, all the pens in the centre of 
the large area, and as many temporary "pens as could be crowded into 
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the vacant space, were filled with sheep; tied up to posts by the gutter 
side were long lines of beasts and oxen, three or four deep, countrymen, 
butchers, drovers, hawkers, boys, thieves, idlers, and vagabonds of every 
low grade, were mingled together in a mass; the whistling of ’''drovers, 
the barking dogs, the bellowing and plunging of the "oxen, the bleating 
of sheep, the grunting and squeaking of pigs, the cries of "hawkers, the 
shouts, oaths, and quarrelling on all sides; the ringing of bells and roar of 
voices, that issued from every public-house; [321-8] the crowding, 

(10) Monologue, concentrated 

mortal fear,--’i- [752-28] -"i won’t scream or cry--not once--hear me— 
speak to "me--tell "me what "i have done!’ 

‘you know, "you "she devil!’ returned the robber, suppressing "his 
breath, ‘"you were watched to-night; every word "you said was heard.’ 

‘then spare "my life for the love of heaven, as "i "spared yours,’ rejoined 
the girl, clinging to "him. ‘bill, dear "bill, "you cannot have the heart to kill 
"me. oh! think of all "i have given up, only this one "night, for "you. "you 
shall have time to "think, and save yourself this crime; "i will not loose 
"my hold, "you "cannot throw "me off. "bill, "bill, for "dear god’s sake, for 
your own, for mine, stop before "you spill "my blood! "i have been true to 
"you, upon "my guilty soul "i have!’ 

the man struggled violently, to release "his arms; but those of the girl 
were clasped round "his, and tear "her as "he would, "he could not "tear 
[753-4 1 ] "them away. 

That diffuse tends to low density and concentrated to high density is indicated 
in Table 2, which represents the distribution of genre types and concentration of 
focus across all the samples. The number of diffuse segments drops off rapidly after 
Sample 1. The number of segments labelled as concentrated increases slightly through 
Samples 1-3; but the combined number of segments labelled concentrated and dia¬ 
logue segments grows rapidly across the samples. Other genre correlations revealed 
by Table 2 are the tendency for narrative and descriptive genre to show low iterative 
density, whereas monologue and especially dialogue show a preference for high itera¬ 
tive density. 

3. types of narrative. Not surprisingly, Table 2 indicates that, after dialogue genre, 
the vast majority of unmixed segments are the narrative ones. Very characteristic of 
narrative segments with the lowest iterative densities (Sample 1) is a subtype which 
depicts a continuously advancing point of view, as on a journey. In example (11) the 
point of view belongs to Sikes and Oliver on a foot journey into the heart of London. 
The movement here is particularly complex because along two axes simultaneously, 
time and space; the new phenomena encountered increase in variety and number 
along both, and naturally make for a very limited number of iterations. 
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Genre: 

Sample 1 

Sample 2 

Sample 3 

Sample 4 

TOTALS 


No. 

% 

No. 

% 

No. 

% 

No. 

% 

No. 

% 

Narrative 

19 

15.6 

15 

12.3 

8 

6.6 



42 

34-4 

Descriptive 

5 

4-1 



1 

0.8 



6 

4-9 

Expository 

1 

0.8 







1 

0.8 

Monologic 

1 

0.8 

2 

1.6 

2 

1.6 

5 

4-1 

10 

8.2 

Dialogic 



6 

4-9 

15 

12.3 

24 

19-7 

45 

36.9 

Mixed 

4 

3-3 

7 

5-7 

4 

3-3 

3 

2-5 

18 

14.8 

TOTALS 

30 

24.6 

30 

24.6 

30 

24.6 

32 

26.2 

122 

100 


Concentrat. 

4 

3-3 

5 

4-1 

6 

4-9 

1 

0.8 

16 

13.1 

Diffuse 

22 

18.0 

12 

9.8 

5 

4-1 

4 

3-3 

43 

35-2 

TOTALS 

26 

21.3 

17 

13.9 

11 

9.0 

5 

4-1 

59 

48.3 


Table 2. Distribution of genre types 

(11) Narrative, diffuse. 

wrong side of the road, [319-15] had endangered "his arriving at the office, 
a quarter of a minute after "his time, the public-houses, with gas-lights 
burning inside, were already open, by degrees, other shops began to be 
unclosed, and a few scattered people were met with, then, came straggling 
groups of labourers going to "'their work; then, men and women with fish- 
baskets on "their heads; donkey-carts laden with vegetables; chaise-"carts 
filled with live-stock or whole carcasses of meat; milk-"women with pails; 
an unbroken concourse of people, trudging out with various supplies to 
the eastern suburbs of the town, as "they approached the city, the noise 
and traffic gradually increased; when "they threaded the streets between 
shoreditch and smithfield, "it had swelled into a roar of sound and bustle, 
"it was as light as "it was likely to be, till night came on again, and the busy 
morning of half the london population had begun. 

turning down sun "street and crown "street, and crossing finsbury 
square, mr. sikes struck, by way of chiswell [320-13] "street, into 

Narrative segments in the next band of iterative densities (Sample 2) show less 
of this subtype, proportionally more of a more static kind of narrative. In example 
(12) the point of view starts with Oliver, moves to the things he looks at, then moves 
back to and remains with Oliver. The limiting of attention to a single character and 
his immediate surroundings makes for repeated mention of the same limited set of 
participants, and thus for a higher iterative density. 

(12) Narrative, diffuse. 

candle was burning low in the socket when "he [99-23] rose to "his feet, 
having gazed cautiously round "him, and listened intently, "he gently 
undid the fastenings of the door, and looked abroad. 
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"it was a cold, dark night, the stars seemed, to the boy’s eyes, farther 
from the earth than "he had ever seen "them before; ’there was no wind; 
and the sombre shadows thrown by the trees upon the ground, ’''looked 
sepulchral and death-like, from being so still. "he softly reclosed the 
Moor, having availed "himself of the expiring light of the candle to tie up 
in a handkerchief the few articles of wearing apparel "he had, sat "himself 
down upon a bench, to wait for morning. 

with the first ray of "light that struggled through the crevices in 
the shutters, Oliver arose, and again unbarred the "door, one timid look 
around--one moment’s pause of hesitation--"he had closed "it behind 
"him, and was in the open street. 

"he "looked to the right and to the left, uncertain whither to fly. 

"he remembered to have seen the waggons, as "they went out, toiling 
up the hill, "he took the same route; [100-23] and arriving at a footpath 

A third and even more intriguing narrative style approaches the level of iterative 
density characteristic of dialogue genre. This is a sequence of events describing char¬ 
acters who communicate with one another in an unspoken dialogue. In example (13) 
Nancy, Sikes, Fagin, and even the dog communicate among themselves, and even to 
themselves, by gesture and by the reported event of dialogue. This makes for a great 
many iterative references to the same set of participants, as in dialogue proper. 

(13) Narrative, diffuse. 

and a half closing of "his deeply- [222-23] set "eyes, warned miss "nancy 
that "she was "disposed to be too communicative, is not a matter of much 
importance, the fact is all we need care for here; and the "fact is, that 
"she suddenly checked "herself, and with several gracious smiles upon mr. 
sikes, turned the conversation to other "matters, in about ten minutes’ 
time, mr. fagin was seized with a fit of coughing; upon "which "nancy 
pulled "her shawl over "her shoulders, and declared "it was "time to go. 
mr. "sikes, finding that "he was walking a short part of "her way "himself, 
expressed "his intention of accompanying "her; "they went away together, 
followed, at a little distant, by the dog, "who slunk out of a back-yard as 
soon as "his master was out of sight. 

the jew thrust "his head out of the room door when sikes had left 
"it; looked after "him as we walked up the dark passage; shook "his 
clenched fist; muttered a deep curse; and then, with a horrible grin, 
reseated "himself at the table; "where "he was soon "deeply absorbed 
[223-31] in the interesting pages of the hue-and-cry. 

4. a note on the narrator. The fact of only one unmixed expository segment 
should be noted. The narrator rarely intrudes with a lengthy general philosophizing 
in this novel, thus affording only one clear segment of the genre in the samples. 
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However the quotation from it in example (6) could remind its reader of another 
voice earlier alluded to—Mr. Micawber! Ironically, the prose of Micawber and the 
prose of Dickens as expositor here share a significant characteristic—a low iterative 
density In fact the density of the expositor’s segment in Twist (13) is much lower than 
that of the Micawber segment from Copperfield (23), and probably owes this trait to 
the same facetiousness in choice of lexis, which the reader is expected to judge ironic 
in the case of the general narrator, but an eccentricity in the case of Micawber. 
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introduction. In this paper I discuss the results of a study of the role of the left 
and the right hemispheres in the verbal reasoning task of solving syllogisms. This 
study aimed to shed neurological light on whether educated individuals who have 
developed an ability to reason logically still preserve and use their pre-logical way of 
reasoning. It therefore investigated whether and how these two modes of reasoning— 
empirical’ (pre-logical, non-formal) and ‘theoretical’ (formal, logical)—are repre¬ 
sented neurologically. I argue that the results of this study require us to reconsider the 
accepted position that only the right hemisphere processes information holistically 
and to recognize that the left hemisphere is responsible for the recognition of inter- 
propositional linguistic wholes. 

1. prior studies in reasoning modes. To give an idea of the reasoning modes I cite 
an example of typical empirical and theoretical answers to syllogisms observed by 
Scribner (1977:485). 

The syllogism: 

(1) All people who own houses pay house tax. Boima does not pay a house tax. 
Does he own a house? 

can have in actual discourse two types of answers: a formally correct one, exemplify¬ 
ing a theoretical mode (signalled by the letter‘t’): 

(it) No, he does not, 

and a formally flawed one, exemplifying an empirical mode (signalled by the letter 
e): 


(le) Yes, he has a house. But he does not pay the tax, because he has no money. 

Though the first answer is logically correct, the second ‘incorrect’ one may be true 
to reality. 

A variety of disciplines—cross-cultural, developmental, educational—and schol¬ 
ars—psychologists, philosophers, semioticians—have addressed the existence of these 
two modes of reasoning (Dolinina in press). 
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Cross-cultural observations (Levy-Bruhl 1923, Levi-Strauss 1966), confirmed by 
cross-cultural experiments (Luria 1976/30’s; Cole, Gay, Glick & Sharp 1971; Cole & 
Scribner 1974; Scribner 1977; etc.), discovered that individuals in a‘traditional’ society 
can use only an empirical (‘pre-logical’) mode, whereas individuals in ‘advanced’ soci¬ 
eties can use a theoretical (‘logical’) mode. 

Two different explanations were offered as to why individuals from traditional 
societies do not develop a theoretical mode of reasoning. ‘Literacy theory’ connected 
appearance of the theoretical mode with literacy (Olson et al. 1985; Ong 1982). This 
explanation proved to be incorrect, since the theoretical mode is not used by tradi¬ 
tional cultures with local literacy—the case of the Vai discussed by Scribner and Cole 
(1981). Evidently, only western-type literacies, which go back to the Greek tradition of 
reasoning, have this effect on cognitive development. 

The ‘discourse theory’ explanation connected inability to use a theoretical mode 
with the absence in the traditional culture of such a ‘script’ as a syllogism. Discourse 
theory argues that the semantic resolution of a syllogistic script is fully dependent 
on its inner content and on the rules for relating the content of the premises with 
one another. It does not project the content against the realities of the outside world, 
which is actually the basis of the empirical strategy (see discussion in Scribner 1977, 
Olson et al. 1985, Ong 1982). Unlike ‘literacy’ theory, the discourse theory explanation 
looks highly plausible. But if it is correct it raises another question: do individuals 
who acquire the ability to work within a syllogistic script, that is within a theoretical 
mode, a) completely switch from empirical to formal reasoning, b) use both strategies 
(that is, manifest‘thought heterogeneity’), or c) still stick to the empirical mode? 

On this issue, cross-cultural and semiotic researchers (e.g., Lotman’s School of 
Semiotics in Tartu University—Lotman 1978, Ivanov 1978, etc.) suppose that both rea¬ 
soning strategies are utilized, but psychologists are not so unanimous. 

Thus, Piaget’s developmental research (Piaget 1964,1971,1975; Inhelder & Piaget 
1951) discusses the formation of reasoning abilities. Piaget claims that formal reason¬ 
ing appears quite late (between 11-15), in the last, fourth, stage of cognitive develop¬ 
ment, and requires formation not only of intra-propositional connections (between 
the predicate and its nominals) but also of inter-propositional connections (be¬ 
tween clauses: causal, inferential, etc.). In his early works Piaget claims that formal 
reasoning always develops, but in formulating his later theory of equilibration he 
comments that its development is socially triggered. But he does not specify whether 
the fourth stage always emerges, or what happens to empirical reasoning when formal 
reasoning is established. 

Contemporary cognitive psychologists are particularly interested in the ‘fate’ of 
the theoretical mode once it is established (Wason & Johnson-Laird 1972; Johnson- 
Laird 1983; Johnson-Laird & Byrne 1991; Moore 1986; Wilson & Sperber 1986). They 
raise two questions: 1) do individuals who acquire the theoretical mode actually use it 
instead of the empirical mode? and 2) is a theoretical mode, that is, formal reasoning, 
represented in the mind as a special mental component? Some cognitive psycholo¬ 
gists recognize the dominance of a theoretical mode once it is established and claim 
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that it is represented in the brain (e.g., Wilson & Sperber 1986), but others argue that 
people continue to use empirical modes of reasoning in normal-life situations and 
that the theoretical mode is not a special mental component. 

Thus, different approaches gave diverse answers as to the status of the two modes 
in human discourse and in the human brain. The neurological experiments discussed 
below aimed to answer this question from a new perspective. Considering the con¬ 
sensus that contemporary educated adults have potential access to both modes, they 
aimed to find out whether use and selection of a mode is somehow determined 1) by 
lateralized functions of the brain hemispheres or 2) by the type of information con¬ 
tained in the syllogism. 

2. DESIGN OF THE NEUROLOGICAL RESEARCH ON REASONING MODE PREFERENCES. The 

study was carried out in the Sechenov Institute of Evolutionary Physiology and Bio¬ 
chemistry, Russian Academy of Sciences, St. Petersburg, Russia, within the project of 
Functional Specialization of Brain Hemispheres. The data were collected from psy¬ 
chiatric patients (predominantly with depression) while they were recovering from 
transitory suppression of one of the hemispheres by electroconvulsive therapy. Spe¬ 
cial attention was paid to contributions of the right hemisphere to language, a most 
controversial issue even now. The project started in the early 1970s. The first report 
(Balonov & Deglin 1976) described the perception and production of sounds under 
temporary suppression by unilateral electroshocks of one hemisphere, leaving the 
other practically normal. Further research investigated syntax, morphology, lexicon, 
idioms and metaphors, and lastly syllogisms. The results were reported in a variety 
of publications in Russian, but very few in English and German and not in linguistic 
journals; the only linguistic response I am aware of is Jakobson 1995/1980. 

The research on syllogisms aimed 1) to test the abilities of literate adults to use 
both reasoning modes and 2) to discover the contribution of the left and right hemi¬ 
spheres to solving syllogisms. For this purpose each subject’s performance was tested 
in three states: before electroshock, and after transitory suppression of the left or right 
brain hemispheres (Cernigovskaja & Deglin 1990, Deglin 1995, Deglin & Kinsbourne 
1996). The question arises how subjects with a suppressed left hemisphere (including 
Wernickes area responsible for language processing) are able to handle syllogisms 
at all. The answer is twofold. First, the restitution of hemispheric functions after uni¬ 
lateral electroshocks, though rather quick (30-40 minutes), is gradual. This fact, and 
the results, suggest that subjects with left hemisphere suppressed were processing 
syllogisms when the left hemisphere had partially restored its functions and was 
able to process intraclausal connections, though not yet the interclausal ones needed 
for formal reasoning. Second, the studies of this group demonstrated that the right 
hemisphere has considerable involvement in language processing—it evaluates the 
semantic validity of linguistic information and verifies it against reality. So the 
right hemisphere may independently process the content of the propositions which 
constitute syllogisms. Thus, with minimal restoration of the syntactic function by 
the left hemisphere (considering the simplicity of the syntactic structures involved) 
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the individuals are able not only to understand, but also to produce, sentences, and 
consequently ‘quasi-syllogisms’, though at this stage they cannot process an inter- 
propositional complex which forms the basis of a real syllogism. 

The tested group included 14 (for Experiment 1) and 10 (for Experiment 2) right- 
handed individuals of both sexes, all with secondary and some with university educa¬ 
tion. The study tested solving of two types of syllogisms: 

a) Experiment 1: syllogisms with true premises (including syllogisms with both 
familiar and unfamiliar content similar to the ones in Luria’s experiments) 

b) Experiment 2: syllogisms with a false premise (a type of syllogisms was 
never tested before). 

3. RESULTS OF EXPERIMENT 1: SOLVING SYLLOGISMS WITH TRUE PREMISES. Here are 

some examples of syllogisms offered to the subjects: 

(2) There are fish in all the rivers where people put nets. People put nets on the 
Neva River. Are there fish in the Neva River, or not? 

(3) Every state has a flag. Zambia is a state. Has Zambia a flag, or not? 

Characteristics of answers to the syllogisms from the subjects in each of the three 
states: 

control state (both hemispheres active): In this state subjects gave predominantly 
theoretical answers (12 of 14). Such a result could be expected, since all the subjects 
were educated within a culture in which syllogisms are used. Only two subjects some¬ 
times gave empirical responses (in accordance with their experiences and/or beliefs) 
to some syllogisms, like the following in response to (2), which is classified as empirical 
because it is based on knowledge of realities, not just on the content of the premises: 

(2e) Everybody knows that there are smelt in the Neva. 

right hemisphere suppression (left hemisphere active): In this state the same 
number of individuals (2) gave some empirical answers. But as a whole there were 
strong differences from the control group. 

First, a more pronounced tendency for usage of a theoretical mode was observed. 
Second, the linguistic behavior of the subjects changed: it became more spontane¬ 
ous; all the tasks were solved more readily, without hesitation, and with much more 
assurance than in the control group. 

Third, in justifying their answers, the subjects referred spontaneously to the contents 
of the premises, and not to their worldly knowledge. Examples of such answers are: 
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(2t) Since it is said here that in all the rivers where people put nets there are fish, 
and the Neva is a river, and people put nets on it, so there are. 

(3t) Since each state has a flag, so Zambia also has one. 

left hemisphere suppression (right hemisphere active): In this case a strong dif¬ 
ference from both previous cases was observed. 

First, the number of empirical answers dramatically increased: 11 subjects (not just 
2) out of 14 used them. Some subjects gave only empirical answers without using the 
theoretical mode at all. In comparison with the control state, where only some syl¬ 
logisms, usually those like (2) or (3) with strongly familiar or strongly unfamiliar con¬ 
tent, were given empirical answers, here all syllogisms independently of the type of 
content were given typical empirical answers, such as: 

(2e') Of course there are, but inedible, they cannot be edible, there is much 
machine oil. Smelt should be. 

(3e) Who knows it, this Zambia? How can I know whether it has a flag or not? 

Second, though empirical answers prevailed, there were far fewer empirical 
answers to syllogisms with unfamiliar content than to syllogisms with familiar con¬ 
tent. In this respect these educated subjects behaved quite differently from Luria’s 
non-schooled subjects, who refused even to discuss syllogisms with unfamiliar con¬ 
tent, claiming that they cannot judge about anything they do not know. 

Third, the subjects’ readiness to use the modes differed from the control group, and 
was the reverse of the case of a suppressed right hemisphere: empirical answers were 
given quickly and with full assurance, whereas theoretical answers were given with 
difficulty and hesitations. 

Experiment 1 thus suggested that one and the same person who has access to 
both reasoning modes may solve one and the same task differently in different states. 
The type of answer depends mainly on which hemisphere is active, and also to some 
extent on the familiarity of the content of the premises. But overwhelmingly the left 
hemisphere uses a theoretical mode, the right hemisphere an empirical mode. So 
this experiment showed ‘that within our culture, under usual conditions the right- 
hemisphere mode of thought [empirical mode - I.D.] is not drawn to syllogism solv¬ 
ing’ (Deglin 1995: 23-24). Thus, in respect to syllogistic reasoning there is a strong 
lateralization of the functions of the hemispheres: this mode of reasoning is carried 
on by the left hemisphere. 

4. RESULTS OF EXPERIMENT 2: SOLVING SYLLOGISMS WITH FALSE PREMISES. This 

experiment aimed to find out how and whether the hemispheres are involved in iden¬ 
tifying the quality of information contained in the syllogisms. 

Examples of a syllogism with false premises (marked with*) used in the test: 
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(4) All monkeys climb trees."The hedgehog is a monkey Does the hedgehog 
climb trees, or not? 

(5) ’''Winter is cold in tropical countries. Ecuador is a tropical country. Is it cold 
in winter in Ecuador, or not? 

Observed responses were of four types: theoretical, empirical mode (two types) and 
‘ambivalent’. Typical theoretical responses followed the rules of making inferences, 
relying only on the inside-the-syllogism information, thus showing absence of criti¬ 
cism of the quality or reliability of information presented by the syllogism: 

(4t) Yes, it (hedgehog) does. 

(5t) Yes, winters are cold in Ecuador. 

Empirical responses were of two types. One type rejected the content of a premise: 

(4e) The hedgehog is not a monkey, 

(5e) It is warm in Ecuador! It is a tropical country! 

The other type, similar to that of Luria’s non-schooled subjects when they were 
offered syllogisms with unfamiliar content, represented a complete refusal to answer, 
justified by the absence of relevant personal experience (no outside-the-syllogism 
information): 

(4e') I have not seen hedgehogs, I do not know whether they climb trees or not. 

‘Ambivalent’ answers (signalled by the letter ‘a’) exhibit a difficulty in choosing 
between the modes. This type of answer was met only in the control state, never in a 
state of suppression of either hemisphere: 

(5a) Due to the structure of what is written, the answer should be positive, but I 
think winters cannot be cold there. 

General results of solving syllogisms with a false premise were the following: 

the control group gave all four types of responses. Predominantly (% of answers) 
empirical responses were used—rejection of the false premise or refusal to solve the 
syllogism. About Vi of the answers were theoretical (ignoring the premise’s mismatch 
with reality). The few ambivalent responses explicitly verbalized the subjects’ hesi¬ 
tancy about which strategy to use—to follow the rules of inference ignoring the false 
premise, or to pursue the truth: 
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(4a) Must I answer so as it is written here? Then the hedgehog climbs trees. But 
it does not climb. It is not a monkey. 

left hemisphere suppression (right active) was characterized by very strong 
rejection of false premises, that is, by using an empirical mode (slightly under 90%). 
The subjects refuted false premises with conviction and with a strong emotional reac¬ 
tion: extreme indignation, extended denials, etc: 

(5e') It is warm in Ecuador! Rubbish! It is a tropical country! It can’t be winter 
there, because it is in the South. Wrong what you write! 

right hemisphere suppression (left active) produced a dramatic change both 
from the control state and from the left hemisphere suppression state. The percent¬ 
age of theoretical answers grew to 75%, whereas that of empirical answers decreased 
to about 25%. Some individuals did not use empirical answers at all. The discourse 
behavior of the subjects who gave theoretical answers was strikingly confident: they 
did not pay any attention to the falsehood of the premises (completely relying on the 
authority of what is ‘said’ and ‘written’), and proceeded with the syllogism without 
even thinking about the quality of the information given to them. Besides, the emo¬ 
tional attitude to the task radically changed—the subjects performed calmly, neglect¬ 
ing the absurdity of the premises. The result was a variety of absurd conclusions, 
derived in accordance with correct rules of formal logic: 

(4t) The hedgehog climbs trees. 

(5t) It is cold in Ecuador. 

5. conclusions. This neurological research: 

1) showed that both theoretical and empirical modes are used by contemporary 
individuals and are simultaneously present in the brain of one and the same 
person. So the experiment confirmed the reality of‘thought heterogeneity’. 

2) suggested that both hemispheres participate in reasoning, but that reasoning 
functions are lateralized: the left hemisphere controls the theoretical mode, 
the right the empirical mode. 

3) established a certain tendency in selection of the modes: the empirical mode 
is preferred for syllogisms with familiar content, the theoretical mode for 
syllogisms with unfamiliar content. 

4) discovered a special role of the right hemisphere: it controls the quality of 
information in the syllogism, whereas the left hemisphere is indifferent to 
the content of the premises. 
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6. discussion. Combining these conclusions with the results of prior studies of rea¬ 
soning modes and of the role of the hemispheres in language processing allows us 
to discuss 1) the existence of a distinctive neurological substrate in contemporary 
schooled (‘bi-mode’) individuals for each mode, 2) the similarities and differences in 
using reasoning patterns between ‘mono-mode’ individuals on the one hand and ‘bi¬ 
mode’ individuals with a suppressed left hemisphere on the other, and 3) the ‘semantic 
responsibilities’ of the right and left hemispheres. 

1) The neurological experiments confirmed the observations of cross-cultural 
research that educated subjects as a rule use both reasoning strategies, and 
demonstrated that use of these strategies has a neurological substrate, since 
each hemisphere operates with a different mode. This, in its turn, suggests 
that each mode has a mental representation in the relevant hemisphere— 
the theoretical mode in the left hemisphere, the empirical mode in the 
right hemisphere. The lateralization of reasoning modes can be regarded as 
a counterevidence to Johnson-Laird’s claim that formal reasoning, unlike 
semantically based reasoning, is not represented in the mind. If his claim 
were correct, how could one explain the ability of‘left hemispheric’ individu¬ 
als (under the suppression of the right hemisphere and thus with blocking 
out the ability to utilize an empirical mode and, consequently, to actualize 
a semantic-based strategy) to reason, and to use exclusively a theoretical 
mode with full neglect of outside-the-syllogism realities? To be actualized 
(even despite semantic incongruencies with the realities of the world, as in 
syllogisms with false premises), this mode needs to be represented in the 
mind. Thus the step-by-step ‘grammar of inference’ must be a self-sufficient 
mental component. Thus formal rules of inference cannot be reduced to 
semantically adequate scenarios, as Johnson-Laird’s semantic model of rea¬ 
soning requires. 

2) The neurological experiments demonstrate that reasoning carried out by bi¬ 
mode individuals with only the right hemisphere active is similar to the 
reasoning of mono-mode individuals (those from ‘traditional’ societies and 
adolescents who have not yet developed the last stage of cognitive develop¬ 
ment and lack formation of inter-propositional connections) in that both 
groups reason in an empirical mode. This discovery gives further support to 
claims of semioticians (already supported by developmental and neurologi¬ 
cal studies) that the empirical mode (the function of the right hemisphere) is 
more archaic, and precedes the appearance of the theoretical mode. There is 
though, as these experiments revealed, a pronounced difference in how both 
groups deal with information outside of their personal knowledge: mono¬ 
modes completely refuse to make any inferences, whereas bi-modes have a 
tendency to use the theoretical mode (which supports the claims of some 
developmental models of reasoning that in acquiring new scientific know¬ 
ledge the theoretical mode is predominantly used). 
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One more point is important in the comparison of the two groups: their percep¬ 
tion of the syllogism. Cross-cultural experiments (Luria’s and later) discovered that 
the mono-modes were unable to repeat the syllogism, because of their inability to 
perceive it as one semantic unit, as a whole. Though this particular aspect of treating 
syllogisms was not directly addressed in this neurological experiment, the responses 
given made it obvious that the bi-modes with suppressed left hemisphere also failed 
to perceive the syllogism as a whole. 

3) Since it is universally accepted that the right hemisphere processes informa¬ 
tion in a holistic way, and the left hemisphere non-holistically, the inability 
of the right hemisphere to recognise the syllogism as a whole, and the con¬ 
trasting ability of the left hemisphere, forces us to modify our understand¬ 
ing of the concept of holicity. The results described here are congruent with 
the linguistic functions of the right hemisphere discovered in other experi¬ 
ments of the same group (since the early 1970s). These experiments showed 
that the right hemisphere plays a much more considerable role in process¬ 
ing semantic information than was earlier supposed, and is not restricted to 
pragmatic aspects or processing of polysemy. This implies that both hemi¬ 
spheres, not just the left, are involved in language processing. According to 
the ‘bi-hemispheric’ approach the right hemisphere is responsible linguis¬ 
tically for (among other things) the referential and semantic correctness 
of words with respect to real-world denotates (including first-order predi¬ 
cates), and the left hemisphere for their syntactic organization (Balonov, 
Deglin & Dolinina 1983) and also for all kinds of ‘connectors’, such as sub- 
ordinators, and also for second-order predicates, especially modal, phasal, 
etc. verbs. So both hemispheres have semantic responsibilities, but their 
scope is quite different: if the right hemisphere works with semantic ele¬ 
ments inside the proposition, the left hemisphere works with semantic 
elements outside propositions, either modifying them or putting them in 
some kind of mutual relations (causal, etc.). This understanding is fully sup¬ 
ported by the reasoning experiments. The right hemisphere appears to con¬ 
trol the quality of information within one proposition—e.g., the truthfulness 
(denotative accuracy) of premises (that is, units equivalent to a proposition), 
testing them against the realities of the world and/or personal knowledge/ 
experience. The left hemisphere is responsible for the correctness of opera¬ 
tional mechanisms (formal correctness of inferences), and also for semanti¬ 
cally combining together into a single unit a set of connected propositions 
(two premises and a conclusion-leading question). This means that the 
right hemisphere works holistically only with respect to a proposition as 
a unit (linguistically speaking, a first-order predicate with its arguments), 
whereas the left hemisphere works holistically with a set of rules of struc¬ 
tural organisation of several propositions (not going inside the proposition) 
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and perceives the new cluster as a unity. So both hemispheres carry on‘holis¬ 
tic’ functions, but their holicity is different in principle. 

This conclusion has to be reconciled with more general ideas of the functioning of 
the hemispheres: the right hemisphere is supposed to operate cognitively with unified 
configurations, whereas the left one is supposed to process discrete items (Witelson 
1987). Evidently the boundaries of the unified configurations, from the linguistic per¬ 
spective, are restricted to entities which can be perceived as a) a recognized unit of 
any linguistic level, independent of how many elements of the lower level constitute 
it and of what grammatical rules were involved in formation of this unit (the highest 
unit so far was considered to be a proposition actualising a single event/situation) and 
b) a content familiar as a whole to the speaker. So the ability to work holistically in 
linguistic terms is the ability to recognize some entity as a unity corresponding to one 
meaning, without going into the detail of what were the rules which created this unity 
(hence the ability of the right hemisphere to understand idioms and metaphors). The 
left hemisphere is primarily processing the rules of formation of more complex units 
from more simple ones, so its work is connected with assembling the modules. 

But the experiments with reasoning modes discovered that it is the right hemi¬ 
sphere which cannot perceive the syllogism holistically, whereas the left hemisphere 
can. How to explain this? Evidently, discourse theory provides the best explanatory 
framework. To be able to perceive such a complex structure as a syllogism consisting 
of a row of propositions as a unified entity (without being able to apply rules, since 
the right hemisphere cannot do that), there must be a prior familiarity with this script 
as a whole. But since this script does not exist (it is very problematic whether it can 
exist at all, considering the necessity of establishing an inter-propositional connec¬ 
tion), this set of propositions does not constitute a single unit. On the other hand, 
the left hemisphere can build such a script because it can assemble modules as large 
as clauses, and so can perceive the resulting entity as a unit. But the meaning of this 
unit is only inferential, not denotative. So if the syllogism constitutes a script with 
a content (as was assumed in the discourse theory of reasoning) it is a very special 
‘syntactic script’—a system of formal rules with the function of those rules, which is 
never tied to a definite content but only to a definite set of rules. In my opinion, the 
latter understanding of the syllogism is much more plausible. 
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introduction. In this paper, I will examine the status of a discourse marker that has 
been claimed to be polysemous. I will demonstrate that all uses of the Russian dis¬ 
course marker ze, traditionally labeled as ‘emphasis’, ‘justification’, contrast’ and even 
the diametrically opposing ‘similarity’ can, in fact, be reduced to a common denomi¬ 
nator. I argue that despite the apparent versatility of ze, this element realizes one dis¬ 
course function; namely, it instructs the addressee to process the information in a 
certain discourse segment vis-a-vis ‘hearer-old’ information. The unified account I 
adopt here will utilize the framework provided by Relevance Theory. 

1. previous treatments of ze: form and functions. This section will sketch the 
previous treatments of ze as a ‘second position’ clitic and as a ‘polysemous’ particle. 

1.1. form: second position. The Russian ze prototypically occurs after the first pro¬ 
sodic word, as in example (1), or constituent in the clause, as in example (2): 

(1) Cej ze den rozdenja #ze segodnja? 

Whose ze birthday #ze today 
‘So, whose birthday is it today?’ 

(2) Pocemu ty emu ne veris? On ze vrac. 

Why you him not believe? He ze doctor. 

‘Why don’t you believe him? After all, he is a doctor.’ 

Generative treatments have, thus, classified ze as a ‘second position clitic’, a phenom¬ 
enon that is commonly attested in almost all Slavic languages as well as in other lan¬ 
guages. Indeed, ze occurs in the second position in almost all instances of declaratives 
(example (3)), interrogatives (examples (4) and (5)) and imperatives (example (7)): 

(3) Masa ze ne znala ctoja dolzen priexaf. 

Masha ze not knew that I should come 

‘After all, Masha didn’t know that I should come.’ 

(4) Ty ze byl v Kitae? 

You ze was in China 

‘You have been to China, haven’t you?’ 


188 


ANNA FELDMAN 


(5) Kakoje ze plate #ze ty kupila? 

What ze dress #ze you bought 
‘What dress did you buy?’ 

(6) Otdai ze mnje moi den’gil 
Give-back ze me my money 
‘Give me my money back!’ 

The analysis of ze as a second position clitic apparently constitutes the correct 
descriptive characterization. Ze follows either the first constituent of a clause (e.g., (2), 
(3), and (4)) or the first prosodic word (e.g., (i), (5) and (6)). However, such a proposal 
of an essentially syntactic-phonological nature, regarding the clitic distribution, does 
not shed light on what factors trigger the first prosodic word to be a host and what 
factors trigger the first constituent to be a host. Thus, for example, it is unclear why 
(1) and (5) are infelicitous when ze occurs after the first syntactic constituent and not 
after the first prosodic word. The analysis of ze as a clitic that occupies the second 
syntactic/prosodic position in a clause is insufficient in many respects. First of all, 
there are cases where ze occurs later than the second position, as illustrated in the fol¬ 
lowing examples: 

(7) My dolzny tam byt’ segodnja ze vecerom 
We must there be today ze evening 
‘We have to be there tonight!’ 

(8) U menja est’ takaja ze kniga 
I have such ze book 

‘I have exactly the same book’. 

In (7) and (8), ze does not appear to occupy the second position, whichever way this 
position is defined. Thus, it is evident from these examples that the distribution of ze 
should be defined in a more refined way, otherwise it does not predict all the occur¬ 
rences of ze. 

Moreover, there are constructions that do not allow ze. One such example is dem¬ 
onstrated in (9): 

( 9 ) #Ty ze podpisyvaessja na ‘Pravdu ili ‘Jersualem Post’? 

You ze subscribe to ‘Pravda’ or ‘Jerusalem Post’ 

‘Do you subscribe to ‘Pravda or to ‘the Jerusalem Post’? 

(9) illustrates the impossibility of using ze in alternative questions. However, if the 
distribution of ze is restricted only to the first syntactic constituent or first prosodic 
word in clauses, the infelicity of this example remains unexplained. 

The aforementioned examples suggest that the characterizing ze as a second posi¬ 
tion clitic over-generates or under-generates the distributional properties of ze. Thus, 
constructions, such as (7) and (8), where ze apparently does not occupy the second 
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position, are under-generated; whereas cases, such as (5) or (9) are over-generated. 
Their distributional properties are not predicted by the ‘second position rule. More¬ 
over, adopting the analysis of ze as a second position clitic, raises the issue of what 
constitutes the first position. The ‘second position analysis in purely syntactic terms is 
not sufficient, as has been explicitly exemplified; and in fact, examination of a wider 
range of data suggests that this phenomenon may have to be linked to information 
structure in general and the concept of information unit in particular. It should be 
mentioned that analyses that have been proposed in an attempt to solve the problem 
of ‘the second position’, had to make a variety of ad hoc assumptions (e.g., Barbosas 
1996 and Hock’s 1996 ‘skipping’, Taylor’s 1996 ’prosodic inversion etc.). The syntactic- 
prosodic analysis of ze neither accounts for the distribution of this element nor pre¬ 
dicts all its occurrences. In section 4 ,1 will demonstrate how the occurrence of ze in 
positions other than second in the clause (as exemplified in (7) and (8)) can be incor¬ 
porated into a‘second position’ analysis. 

1.2. function: the traditional four kinds of ze. As has been mentioned, the dis¬ 
course marker ze is highly frequent in Russian. Traditional grammarians (e.g. Rozen- 
tal and Telenkova 1985 and others) treat ze as a ‘polysemous particle’ and characterize 
the range of meanings in terms such as ‘emphasis’, ‘contrast’ and even the completely 
opposite ‘similarity’, as well as providing examples of the type that I refer to as ‘justi¬ 
fication’. The range of possible meanings that were enumerated by traditional gram¬ 
marians is demonstrated in the following examples: 

a. contrast. Examples such as (10) and (11) are treated by traditional grammarians 
as contrastive. Accordingly, it is ze that contributes a so-called contrastive reading to 
these utterances. 

(10) On ostaetsja, ona ze uezzaet 

He stays, she ze leaves 

‘He is staying, [but] she, after all, is leaving 

(11) Masa umnaja, Dasa ze net 

Masha smart, Dasha ze not 

‘Masha is smart, [but] Dasha, after all, is not ’ 

b. jusTiFiCATiON._Another meaning of ze that I refer to as ‘justification’ is compatible 
with the English after all or Hebrew harey. 

(12) Cto ty stois? Sadis’v masinu. Ona ze nasa. 

What you stand? Get into car. It ze ours. 

‘What are you waiting for? Get into the car. It’s ours.’ 

(13) Potoropis’l On ze tebja zdet! 

Hurry up! He ze you wait 

‘Hurry up! He’s waiting for you, after all! 
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c. emphatic readings. There are certain uses of ze that traditional grammarians 
relate to as ‘emphatic’. What this term designates is unclear. In the following examples, 
ze is treated as having a so-called ‘emphatic’ reading. In these utterances, ze is analo¬ 
gous to the Hebrew kvar (‘already’) or nu whose function in certain contexts can be 
defined as ‘urging’ or ‘driving forward’. The following examples, according to the tra¬ 
ditional approach, display an emphatic function: 


(14) Kogda ze ty uberes v komnate? 

When ze you will clean in room 
When will you clean your room?’ 

(15) Skazi ze mne cto-nibud’! 

Tell ze me something 
‘Say something already!’ 

(16) My dolzny byt’ tam segodnja ze vecerom! 

We must be there today ze evening 
‘We have to be there tonight!’ 

(17) Cto ze mne delat’? 

What ze me to do 
What shall I do then?’ 

(18) Kto ze znal cto pogoda yizmenitsja? 

Who ze knew that weather change 

‘Who could have known that the weather would change? 

d. similarity. Sometimes, ze is described as conveying the ‘similarity’ meaning: 

(19) U menja est’ takaja ze kniga 
I have such ze book 

‘I have exactly the same book.’ 

(20) V tu ze minutu razdalsja zvonok 
At that ze moment rang bell 

‘At that very moment the bell rang’. 

The inadequacy of a polysemous account is evident from a comparison of instances 
occurring with and without ze. Thus, (2i)a and b exemplify the ‘contrastive’ reading: 

(21) a. On ostaetsja, ona ze uezzaet 

He stays, she ze leaves 
‘He is staying, [but] she, after all, is leaving, 
b. On ostaetsja, ona uezzaet 
He stays, she leaves 
‘He is staying, [but] she is leaving. 
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Clearly, the contrastive meaning is not contributed by ze itself. Even without the par¬ 
ticle ze, these examples are contrastive, because the contrast is present in the seman¬ 
tics of the utterances, as indicated in (2i)b. Therefore, the ‘contrastiveness’ cannot be 
characterized as an inherent property of this element. 

A similar claim can be made about the ‘justification’ and ‘similarity’ readings, as is 
demonstrated in (nja and b, and (23)a and b, respectively: 

(22) a. Cto ty stois? Sadis’ v masinu. Ona ze nasa. 

What you stand?Get into car. It ze our car. 

‘What are you waiting for? Get into the car. It’s ours.’ 
b. Cto ty stois? Sadis’ v masinu. Ona nasa. 

What you stand? Get into car. It ours. 

What are you waiting for? Get into the car. It’s ours.’ 

The justification meaning is preserved when ze is omitted, as in (23b). 

(23) a. U menja est’ takaja ze kniga 

I have such ze book 

‘I have exactly the same book.’ 
b. U menja est’ takaja kniga 

I have such book 

‘I have such a book.’ 

(23)b can be interpreted as displaying the ‘similarity’ meaning. Thus, like contrast’, 
‘justification and ‘similarity’ are not inherent properties of ze either. 

As for the term ‘emphatic’, it is problematic too, since it covers a variety of uses 
with no apparent common denominator as in (17) and (18) that cannot be clearly 
characterized in terms of emphasis. No parameters for defining the emphatic function 
are provided and the term is left loose and open-ended. As in the previous cases, the 
elimination of ze will result in no change of propositional content. 

(17) Cto ze mnje delat? 

What ze me to do 

‘What shall I do then?’ 

(18) Kto ze znal cto pogoda izmenitsja? 

Who ze knew that weather change 

‘Who could have known that the weather would change? 

In sum, the treatment of ze as a polysemous element that has a variety of totally 
unrelated uses has been shown to be inadequate in accounting for the range of data. 
The list of uses of ze is not exhaustive. As has been shown, some examples cannot be 
classified under any of the given labels (e.g. the so-called cases of emphasis). Likewise, 
the fact that totally arbitrary and completely opposed functions, such as ‘similarity’ 
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and contrast’, have been associated with ze suggests that a more insightful generaliza¬ 
tion concerning its meaning is required. 

As previously noted, in all of the given examples, the different readings are pres¬ 
ent in the propositional content of the utterance, with or without ze. Thus, ze does 
not contribute to the propositional content of the utterances. This leads to the pos¬ 
sibility that an analysis of ze should examine its contribution to the organization 
of the information structure, rather than to the semantics or propositional content of 
the utterance. Along these lines, a unified analysis of ze, accounting for the variety 
of uses, will be proposed within the framework of Relevance Theory. 

2. multi-functionality and common denominator. In this section, the Russian 
discourse marker ze is examined within the conceptual/procedural dichotomy pro¬ 
posed by Sperber and Wilson (1993) in the framework of Relevance Theory. The 
specific procedural function of ze is argued to be both relevance-motivated and struc¬ 
turally determined. The relevance-theoretic framework allows for a unitary account 
for the seemingly disparate uses of ze. 

2.1. ze in relevance-theoretic terms. As will be shown in this section, despite the 
apparent versatility of ze, this element realizes a unified discourse function; namely, 
it instructs the addressee to process the information contained in the utterance seg¬ 
ment vis-a-vis activated or re-activated information that occurs in the given linguistic 
or extra-linguistic context. 

Before presenting the Relevance-theoretic account of ze, the necessary details of 
the general theoretical framework will be outlined. 

2.1.1. relevance theory and conceptual/procedural dichotomy. Relevance 
Theory, proposed by Sperber and Wilson (1986), is based on the assumption that 
human understanding is guided by relevance considerations, which control the infer¬ 
ential processes involved in interpretation. Sperber and Wilson’s (1986) fundamental 
idea is that hearers are bound to interpret utterances as maximally relevant where 
relevance is regarded as a trade-off between contextual effects (i.e. informativity) and 
processing costs. The role of the context is to supply assumptions that are brought to 
bear in the assessment of relevance. This means that expressions that constrain the 
hearer’s choice of context must be analyzed as linguistic (or semantic) constraints on 
relevance. The speaker’s task is to create an expectation that the utterance will achieve 
enough contextual effects to be worth processing for the hearer and cause him no 
unnecessary processing effort (optimal relevance). The hearer’s task is to assess which 
effects the speaker could have intended to achieve, in other words, to constrain a con¬ 
text which will make the utterance worth processing. 

Sperber and Wilson (1986) proposed a distinction between conceptual and pro¬ 
cedural elements. Conceptual meaning encodes information that contributes to the 
propositional content and prototypically truth-conditionally relevant, whereas pro¬ 
cedural meaning encodes processing instructions, signaling to the addressee how to 
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process the utterance and usually does not contribute to the propositional content of 
utterances. 

Discourse markers contribute to relevance by operating as signals that tell the 
hearer how an utterance is to be understood, thus, reducing the processing efforts. 
Accordingly, the discourse marker ze is a procedural element, signaling to the 
addressee that the speaker takes certain information in the utterance to be hearer-old 
(in the sense of Prince (1981), Birner & Ward (1998)), and instructing the addressee to 
process the information contained in the utterance segment vis-a-vis activated or re¬ 
activated information. Prince (1981) offers a pair of crosscutting dichotomies which 
classify information as, on the one hand, either ‘discourse-old’ or ‘discourse-new’; and 
on the other hand, either ‘hearer-old’ or ‘hearer-new’. This distinction is essential in 
accounting for various distributional constraints, such as the so-called definiteness 
effect in existential sentences. Discourse-old information is information evoked in 
the current discourse; hearer-old information is the information that the speaker 
believes to be present within the hearer’s knowledge store. What is new to the dis¬ 
course is not necessarily new to the hearer. However, what is ‘discourse-old’ should 
be necessarily ‘hearer-old’. Constructions may vary with respect to whether they are 
sensitive to discourse-familiarity or hearer-familiarity. The element ze, thus, indicates 
to the addressee that the information contained in the current utterance or activated 
by it is expected to be ‘hearer-old’, thereby restricting the context used in processing. 

2.1.2. the unified account in relevance theory. In this section, the variety of 
instances of the discourse marker ze is analyzed. The utterances in (12) with and with¬ 
out ze as in the following have different contextual effects: 

(24) a. Cto ty stois? Sadis’v masinu. Ona ze nasa. 

What you stand? Get into car. It ze ours. 

‘What are you waiting for? Get into the car. It’s ours.’ 
b. Cto ty stois? Sadis’ v masinu. Ona nasa. 

What you stand? Get into car. It ours. 

‘What are you waiting for? Get into the car. It’s ours.’ 

In (24)b, the information that the car belongs to the speaker is not taken to be ‘hearer- 
old’. In this case, the speaker may merely be informing the addressee that this car 
belongs to him/her. A possible situation where (12)b can be uttered is a context where 
the speaker who bought a new Porsche without letting his wife know points to the car 
uttering Cto ty stois? Sadis' v masinu. Ona nasa. In this context the ze-utterance will 
be infelicitous, since the fact that the new Porsche belongs to the family is completely 
new to the hearer ( the wife). In (24)a with ze, on the other hand, the fact that 
the car is ‘ours’ is assumed by the speaker to be in the hearer’s store of knowledge. 
Such an utterance will be natural in the following context: looking for a car at the 
parking lot and eventually having found it , the husband says to his wife who is 
still not recognizing her property: Cto ty stois? Sadis' v masinu. Ona ze nasa. Ze 
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is only felicitous in a context where the speaker assumes that this information is 
already known or should be known to the hearer. In this case the information may 
be retrieved from long-term memory, from the situational context or the linguistic 
context. Consider ( 25 ) a and b: 

(25) a. Potoropis’! Oni ze tebja zdut! 

Hurry up! They ze you wait 
‘Hurry up! They are waiting for you, after all! 
b. Potoropis’! Oni tebja zdut! 

Hurry up! They you wait 

‘Hurry up! They are waiting for you! 

(25)a can occur without prior linguistic context. For example, in a theatre, the audi¬ 
ence is waiting for one of the actors, who is still busy with his make-up. The actor 
is likely to be told: ‘The audience ze is waiting for you!’ This context explicitly shows 
that the information within the utterance is given to the addressee or at the very least 
is assumed by the speaker to be given. The same linguistic sequence without ze will 
neutrally convey the information that they are waiting/ or the audience is waiting that 
might or might not be new to the hearer; however, when ze occurs in this utterance, 
the information must be ‘hearer-old’. All instances of ze can be analyzed along these 
lines. 

(10) is felicitous in a context where the hearer is supposed to know or to be able to 
infer that the person referred to is leaving. By using ze, the speaker intends to signal 
that the addressee is expected to have the relevant information in his store of knowl¬ 
edge. A possible context for (10) might be a situation like the following: 

A: Are they both leaving? B: He is staying. She ze is leaving. 

Such a context explicitly shows that the information that she is leaving is not new for 
the hearer. However, if the speaker chooses to inform the hearer that ‘he is staying, 
she is leaving’ without the previous assumption that the information is ‘hearer-old’, 
the utterance with ze will be infelicitous. 

(14)—(18) which were treated as indicating emphasis can be dealt with in a similar 
fashion. In all of them, ze is used to instruct the addressee to process the information 
contained in the utterance segment vis-a-vis activated or re-activated information 
that occurs in the given linguistic or extra-linguistic context. (14) is felicitous in a 
context like the following: 

A child was asked to clean his room. It is likely to happen that when his mother 

walks into the room and sees that the room is a mess, she will utter ‘When ze 

will you clean your room?’ 
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This context clearly demonstrates that ze can be used only when the information 
modified by this element is ‘hearer-old’. To the kid the information that he should 
have cleaned the room is not new. 

From the examples discussed so far, it is evident that the Russian entity ze func¬ 
tions as a unified signal to the addressee to process the information contained in 
the utterance segment in question vis-a-vis activated or re-activated information that 
occurs in the linguistic or extra-linguistic context. 

I will now address the ‘second position’ issue. 


2.1.3. form - 2nd position relative to scope. As has been mentioned earlier in 
the paper, in previous literature, ze was treated as a second position particle. In (7), 
(8), and (1), the position occupied by ze is neither the second prosodic word, nor the 
second syntactic position in the clause. Note that in (7), the Russian expression for 
‘tonight’ consists of two words ‘today evening’. A possible context for (7) might be the 
following: 


a. 

b. 


d. 


Masha: When are we going? 

Ivan: We are going tonight, (in Russian ‘today evening’) 

Masha: Let’s wait for Gosha and go tomorrow. 

Ivan: We have to be there tonight ze (in Russian ‘today ze evening’) 


In (d), ze modifies a domain smaller than the clause. Using ze, the speaker explicitly 
indicates that the information that they are going tonight may be retrieved from the 
context (in our case from the linguistic context, specified in b). The relevant infor¬ 
mation that should be reactivated in this case is not the whole clause ‘we are going 
tonight’, but only the constituent ‘today evening’. In order to express the retrievability 
from context of‘we are going tonight’, the speaker would use ze following ‘we’ in b: ‘We 
ze are going tonight’. 

To illustrate further that ze can have a domain smaller than the clause, consider 
context (8)' for the utterance in (8). 


(8)' a. A: I don’t believe it, I have such a book! 

b. B: I am not sure that this is the same book. 

c. A: No, I bought such ze a book, (i.e., I bought exactly the same book) 

In this example again, the information that should be retrieved from the hearer’s store 
of knowledge is not that, contained in the whole utterance in (8)'a, but the constitu¬ 
ent ‘such ze a book’. The event of ‘A’s buying a book’, expressed in the predication in 
(8)'c, might be unknown to the hearer, but the fact that A has exactly the same book 
is evident from the linguistic context. 

It should be mentioned that (8) can occur without prior linguistic context. In a 
context, where the speaker sees that a book, lying on the table, is exactly the same as 
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his, he is likely to utter (8). It is thus evident that the information, which is taken to be 
re-activated, need not be linguistically specified, but could be provided situationally. 

(1) is another example where ze seems to modify not the whole clause but only a 
single constituent—in this case again an NP. A possible context for (1) might be the 
following: During a birthday party, a person suddenly asks one of her friends: 

a. A: Whose birthday is it today? Mary’s? 

b. B: No. 

c. A: Peter’s? 

d. B: No. 

e. A: Whose ze birthday is it today? (The Russian variant of this utterance is 

given in (1)) 

The difference between the question without ze, in a, and its counterpart with ze, in 
e, is again in the contextual effects of these utterances. In the question without ze, the 
speaker has a presupposition that this is somebody’s birthday, because it is a birthday 
party. Such a presupposition is based on the knowledge of the world. In the case with 
ze, in addition to this presupposition, the speaker makes the assumption that there 
exists a restricted set of people whose birthday it might be and that this set in ques¬ 
tion is available to the hearer. The possible felicitous responses to this question are 
restricted. They have to include members of the relevant set. The question in (a), with¬ 
out ze, does not indicate that the hearer must choose one of the people from a given 
group. Thus, in addition to the regular effect of the wh-question, where the speaker 
assumes some set of information options to be given, ze restricts a given set to a given 
proper set. 

It appears that expressions, such as those in (1), (7) and (8), are not exceptions. 
They can be subsumed under the ‘second position’ analysis. As can be easily observed, 
the discourse marker ze occupies the second position in a relevant sub-segment. 

Such an effect is characteristic of other constructions, such as wh-clefts and con¬ 
ditionals. 

According to Prince (1978), a wh-cleft occurs coherently in a discourse only if 
the material inside the (subject) wh-clause represents material which a cooperative 
speaker can assume to be appropriate in the hearer’s consciousness at the time of 
hearing the utterance. In relevance-theoretic terms, adopted for our analysis, this 
material is already hearer-old information. When ze occurs in such constructions, it 
further restricts the context, i.e., this element signals that the information modified by 
ze is properly included in the given relevant set of information alternatives which is 
hearer-old. Consider the following example: 

(26) a. To ctoja xocu skazat’—sovsem ne to cto y y dumaete. 
what I want to say—completely not what you think 
‘What I want to say is completely different from what you think.’ 
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(26) b. To cto zeja xocu skazat'—sovsem ne to cto vy dumaete. 

what ze I want to say—completely not what you think 

‘What I want to say is completely different from what you think’. 

The difference between (26)a and b is in their contextual effects. (26)a indicates that 
the hearer knows the speaker is going to say something. In addition to this effect, ze 
in (26)b signals that the hearer not only knows that the speaker is going to say some¬ 
thing, but he knows the speaker could have said a number of things but he chose only 
one alternative from the relevant set. (26)b might occur in a context where the speaker 
utters: ‘You, of course, think that I am going to give you a lecture on morality, or maybe 
you think that I will read you a passage from the Bible, but what ze I want to say is 
completely different from what you think’. The speaker who uses a wh-cleft with ze 
instructs the hearer to infer that the choice of the particular set in question is available, 
even though the addressee does not know what exactly the speaker intends to say. 

Ze functions in a similar fashion in conditionals. Examine (27)a and b: 

(27) a. Yesli ja vypju sejcas kofe, ja ne budu spat noc'ju. 

If I drink now coffee, I will not sleep at night 
‘If I drink coffee now, I will not sleep at night.’ 
b. Yesli zeja vypju sejcas kofe, ja ne budu spat nocju. 

If ze I drink now coffee, I will not sleep at night 
‘If I drink coffee now, I will not sleep at night.’ 

The utterances given in (27) a and b differ from each other in that they might occur in 
distinct contexts. The condition in (27)a might or might not be new information to the 
addressee. However, (2j)b indicates that the hearer is supposed to infer that there exists 
a set of conditions, and that the condition in the utterance is one of the relevant alter¬ 
natives. The interaction of the function of ze with the general function of conditional 
sentences will result in an additional contextual effect. Thus, (22)b might appear in a 
context like the following: ‘If I drink hot milk now, I will fall asleep and will not be able 
to write my paper, but if ze I drink coffee now, I will stay awake for the entire night’. 

To sum up, as has been shown, ze instructs the addressee to process the informa¬ 
tion contained in the relevant utterance segment vis-a-vis activated or reactivated 
information that occurs in the linguistic or extra-linguistic context. As has been 
illustrated, ze is only felicitous in a context where the speaker assumes that relevant 
information is already known or could be available to the addressee from the previ¬ 
ous discourse or from the non-linguistic context (either situational or based on the 
knowledge of the world). 

I have drawn upon the traditional analysis of ze as a second position particle, extend¬ 
ing it to account for those cases in which ze does not seem to occupy the second posi¬ 
tion. I have shown that ze can take scope over constituents smaller than the clause, and 
in these cases, it occurs in the second position within this constituent. Thus, these appar¬ 
ently exceptional cases can be subsumed under the second position analysis. 
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In adopting the second position analysis for ze, the question that emerges concerns 
the nature of the first position. It has been shown that ze can refer to the whole utter¬ 
ance as well as to a relevant sub-segment. 

3. further predictions. The analysis proposed in this paper makes the correct pre¬ 
dictions with respect to such constructions as yes-no questions and alternative ques¬ 
tions. Thus, for instance, if ze occurs in a polar interrogative, it will interact with the 
general function of this kind of questions and the result will be a rhetorical question 
or a request for confirmation. 

(28) a. Ty ze podpisyvaessja na 'Jerusalem Post’? 

You ze subscribe to ‘Jerusalem Post 
‘You subscribe to ‘the Jerusalem Post, don’t you/ right?’ 
b. Ty podpisyvaessja na ‘Jerusalem Post’? 

You subscribe to ‘Jerusalem Post 

‘Do you subscribe to the Jerusalem Post?’ 

Ze indicates that the speaker assumes that the information modified by ze, i.e. that 
X subscribes to the Jerusalem Post is known. The utterance in (28)a can appear in a 
context like the following: 

Masha to Ivan: I have no idea who Putin is. Is he still working for the KGB? 

Ivan to Masha: You ze subscribe to ‘the Jerusalem Post’, don’t you? Why don’t you 
ever read it? 

The utterance in the context indicates that the information contained in the question 
is already taken by the speaker to be in the hearer’s store of knowledge. The basic 
function identified for ze can account for such cases as well. Ze instructs the addressee 
to process the information contained in the utterance segment vis-a-vis activated or 
re-activated information that occurs in the given linguistic or non-linguistic context 
(in this case, knowledge of the world). The interaction of the function of ze with the 
general function of yes-no questions will result in rhetorical questions or in request 
for confirmation, as demonstrated in (28)a. (28)b, which does not contain ze will 
occur in another context. For example, ‘I would like to see the latest article on Putin. 
Do you subscribe to the Jerusalem Post?’ In this context, the speaker does not convey 
an assumption that he already knows the hearer subscribes to the Jerusalem Post. 

Another prediction that can be made on the basis of the analysis proposed here is 
that ze will not occur in alternative questions, as illustrated in (29): 

(29) # Ty ze podpisyvaessja na ‘Pravdu’ili ‘Jerusalem Post’? 

You ze subscribe to ‘Pravda or ‘Jerusalem Post’ 

‘Do you subscribe to ‘Pravda or ‘the Jerusalem Post’? 
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Under the proposed analysis this fact follows naturally. The impossibility of using ze 
in example (29) is predictable. Ze will be infelicitous in a context where the speaker, 
uttering an alternative question, has no prior expectation as to which alternative will 
be chosen. The freedom to choose between two or more alternatives blocks the func¬ 
tion of ze as a marker of known/inferable information which should be processed 
vis-a-vis activated or re-activated information. Alternative questions contain mutu¬ 
ally exclusive information and therefore, will never co-occur with ze that takes scope 
over information known but not salient at the given point, as illustrated in (29). How¬ 
ever, this question becomes felicitous, if we take the sequence ‘Pravda or the Jerusa¬ 
lem Post’ in a context, where the addressee is not supposed to choose between two or 
more things; for example, if that speaker merely wants to make sure that the hearer 
indeed subscribes to one of these newspapers, no matter which one. In this case, then 
‘Pravda or the Jerusalem Post’ is taken as one item of information and the question 
is then not treated as alternative. The information that the hearer subscribes to one 
of these newspapers without necessarily specifying which one exactly is taken to be 
‘hearer-old’, and thus, this question functions as request for confirmation. 

4. conclusion. I have proposed a unified account of ze as a discourse marker that 
instructs the hearer to process the information contained in the utterance or in a rel¬ 
evant sub-segment vis-a-vis activated or re-activated information that occurs in the 
given linguistic or extra-linguistic context. The study of the Russian discourse marker 
ze indicates that adopting Relevance Theory considerations affecting discourse struc¬ 
ture with a distinction between conceptual and procedural uses of lexical items makes 
a coherent and unified characterization of this discourse marker possible. 

The previous analyses in syntactic-prosodic terms have been shown to be unable 
to explain the non-occurrence of ze in alternative questions as in (29). They also were 
incapable of accounting for the occurrence of ze in positions that in terms of linearity 
could not be described as second, as in (7) or (8). It has been shown that the scope 
of ze is insightfully accounted for by its position in utterances, defined in terms of 
linearity and information units. 

The fact that ze is given a unitary account within a theoretical framework that rec¬ 
onciles Relevance factors and structural consideration, raises a question for further 
investigation as to its applicability in cross-linguistic analyses of discourse markers 
in general. The question whether such an analysis indeed applies insightfully to other 
discourse markers and provides a coherent definition of discourse markers awaits 
further research. Since codification of information as ‘hearer-old’ is evident cross 
linguistically, and in most cases is codified lexically, comparative studies could thus 
increase our understanding of the division of labor between the various discourse 
markers in different languages. 


The term ‘justification does not appear in the traditional textbooks. The traditional 
descriptions provide examples of the type that I refer to as ‘justification’. Most dictionaries 
consider English after all or Hebrew harey as the best translations of ze. 
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2 I use the terminology proposed by Halpern (1992,1996), Inkelas (1989), Marantz (1988), 
Progovac (1996) and others, who differentiate two kinds of so-called ‘second position: 
after the first prosodic word or after the first syntactic constituent. This terminology is 
used for Serbo-Croatian, Polish, Greek and other languages. 

J It is difficult to translate utterances with ze into English. English cannot codify certain uses 
of ze in an exact way. 

1 In the following sections cases where ze does not appear second, whichever way this posi¬ 
tion is defined, will be included in the ‘second position analysis. 

> Note that in such examples the use of ze can be regarded as attitudinal, since it codifies 
some attitude conveyed by the speaker. 

6 Principle of Relevance: Every act of ostensive communication communicates a presump¬ 
tion of its own optimal relevance (Sperber & Wilson 1986). 

7 In the light of Sperber and Wilson (1993), studies utilizing the conceptual/procedural dis¬ 
tinction have indicated the need for some modification in this strict division and have 
claimed that while it is feasible to maintain the linguistic/non-linguistic division, this 
does not correspond to the separation between truth-conditional and non-truth-condi- 
tional aspects of propositions. 

s It should be mentioned that the codification of information as ‘hearer-old’ is evident 
cross-linguistically. Thus, for example, the Hebrew partial correlates of ze are nu, mamash, 
kvar, harey, etc., whereas the English partial correlates of the Russian discourse marker are 
after all, already. The following examples indicate the relevant ‘hearer-old’ codification: 

a. Why don’t you believe him? He zethareyIafter all is a doctor. 

b. Come ze\ / Come kvar!/ Nu, come!/ Come on already ! She is waiting for you! 

c. I have such ze a book./1 have exactly the same book/1 have mamash the same book. 

9 The effect of the interaction between the position of ze and various intonation clues is left 

for further research. 
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introduction. Nonverbal communication has received attention in many disci¬ 
plines. Apart from communication studies in the narrow sense, it forms one of the 
areas of interest in psycholinguistics, semiotics, language acquisition research, foreign 
language studies and rhetoric as well as in ethology, sociology and psychology 1 . It is 
striking that the majority of studies from these areas neglect the historical dimen¬ 
sions of nonverbal communication. There is of course no denying that reference is 
made to theoretical works of modern times such as John Bulwer’s Chirologia and 
Chironomia (1644), Johann Jakob Engel’s Ideen zu einer Mimik (1785) and Charles 
Darwins The expression of the emotions in man and animals (1872) 2 , but the fact that 
nonverbal communication was analysed in Greek and Roman antiquity from a more 
theoretical perspective has only been emphasized in the past few years, mainly with 
reference to the connection between rhetoric and gesture 3 . This is even more remark¬ 
able since there exists a number of archaeological investigations into the depiction of 
nonverbal communication on ancient vases and coins as well as in the form of sculp¬ 
tures 4 . It is the aim of my contribution to give a brief overview of the most important 
aspects of theoretical reflections on nonverbal communication in ancient writings. In 
doing so, I will attempt to highlight some vital perspectives for future research which 
I am currently pursuing in a more comprehensive project. 

1. ELEMENTS OF NONVERBAL COMMUNICATION AS HUMAN UNIVERSAL LANGUAGE. 

Nonverbal communication is referred to in ancient sources in a variety of contexts. 
Very often the authors in question write about situations in which people with dif¬ 
ferent mother tongues come into contact and find themselves in need of a mode of 
communication. The ancient sources mention various circumstances: Communica¬ 
tion functions unproblematically if one speaks the language of the other or if there 
are competent interpreters who can assist people in making themselves understood. 
In the Roman world, it is predominantly people from the educated upper-class who 
have a reliable knowledge of Greek, mainly because of their bilingual education in 
Latin and Greek 5 . Among the Greeks, however, other cultures and languages were 
pushed into the background to the extent that other peoples (and for a long time 
even the Romans) were regarded as ‘barbarians’ and their languages, accordingly, as 
barbarous idioms. Viewed like this, it is not surprising that the low degree of open- 
mindedness and receptivity for foreign languages among Greeks and Romans should 
often have resulted in communication problems with other peoples 6 . 
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That linguistic diversity and the lack of ability to communicate by means of spoken 
language may lead to alienation among people and that it can prevent more inten¬ 
sive contact with others right from the beginning, is a point mentioned by Pliny the 
Elder ( Naturalis historia 7.7) and, later, by Augustine. According to the latter, the feel¬ 
ing of being different from a person with another language might lead one to prefer 
the company of ones dog to that of a foreigner ( De civitate Dei 19.7 = PL 41.633-34 
Migne). Being forced into a position of non-communication on the grounds of lack¬ 
ing competence in foreign languages appears to have been a matter of embarrassment 
to the ancients. Thus, silence is best prevented by avoiding contact with foreigners 7 . 
In his Tusculans , Cicero underlines—interestingly starting from the Romans’ non¬ 
existent knowledge of Greek and the Greeks’ lack of familiarity with Latin—that not 
to understand a foreign language is tantamount to being deaf ( Tusc. 5.116). 

In Roman antiquity, the Latin language was valued as an integral part of the unity 
of the Roman empire. Connected with this was not only common ground on moral, 
political and religious levels but also the idea of a linguistically distinct society. This is 
pointed out by Cicero in a passage of his work De officiis (1.53), where he attributes to 
the mother tongue a high integrative value for its speakers. In addition to their politi¬ 
cal and national affiliation, the common language plays a vital role for the develop¬ 
ment of their identity; it generates a feeling of solidarity, a sense of a common bond 
among its speakers 8 . At the same time, we can deduce from passages such as Cicero’s 
that in his time there was already a well-developed awareness concerning linguistic 
diversity in the ancient world, even if on the whole the Romans did not pay much 
attention to foreign languages other than Greek. 

In this context, it is remarkable that ancient writers often point out that forms of 
nonverbal communication such as gesture and facial expression not only serve native 
speakers as visual support for the spoken word, but also that because of their vivid¬ 
ness and concreteness they can function as a kind of universal language. According to 
some authors, this is due to the fact that all human emotions have by nature a corre¬ 
sponding expression in face, voice and gesture—a hypothesis which is much disputed 
in modern research 9 . As Cicero notes: 

Atque in eis omnibus, quae sunt actionis, inest quaedam vis a natura data; qua 
re etiam hac imperiti, hac vulgus, hac denique barbari maxime commoventur: 
verba enim neminem movent nisi eum, qui eiusdem linguae societate coniunctus 
est, sententiaeque saepe acutae non acutorum hominum sensus praetervolant: actio, 
quae prae se motum animifert, omnis movet; isdem enim omnium animi motibus 
concitantur et eos isdem notis et in aliis agnoscunt et in se ipsi indicant. (Cicero, 

De oratore 3.223) 

‘And all the factors of delivery contain a certain force bestowed by nature; which moreover 
is the reason why it is delivery that has most effect on the ignorant and the mob and lastly 
on barbarians; for words influence nobody but the person allied to the speaker by sharing 
the same language, and clever ideas frequently outfly the understanding of people who are 
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not clever, whereas delivery, which gives the emotion of the mind expression, influences 
everybody, for the same emotions are felt by all people and they both recognize them in 
others and manifest them in themselves by the same symbols.’ 

In another passage Cicero compares elements of human nonverbal expression to a 
string instrument in terms of function and effect ( De oratore 3.216). For a linguisti¬ 
cally ill-prepared listener or a foreigner, gesture, facial expression and voice illustrate 
everything which the spoken word cannot express in a similarly clear way. These con¬ 
siderations lead Cicero to insist on the following requirement in particular for the 
professional orator: Even if an orator’s talent for verbal expression is highly devel¬ 
oped, he should not refrain from making skilful use of accompanying nonverbal ele¬ 
ments in order to convince the whole range of his multi-faceted audience. 

In a rhetorical context one has also to mention the treatment of nonverbal com¬ 
munication in Quintilian’s didactic treatise Institutio oratoria , but, as in the case of 
Cicero, they also transcend the boundaries of instruction on how to use facial expres¬ 
sion, gesture and voice. It is emphasized that the importance of these elements for 
rhetoric can be recognized from their effects in other areas. Quintilian argues that 
much information can be transmitted without words if one lets the body speak—it 
is mainly the hands and the face which serve as instruments of communication by 
mutes. Similarly, animals signal emotions such as wrath and joy by body language. 
A comparable practice is the expression of a state of mind by dancing (Inst. orat. 
11.3.65ff.; cf. also 11.3.87 with reference to the hands as the‘universal language’ of man), 
which is, of course, a topic of a treatise on the defence of this art (De saltatione ) 
written more than half a century later by the Greek author Lucian 10 . According to 
Lucian, the pantomimic dance portrays different characters and emotions (De salt. 
35ff, cf. also 67), especially figures from mythology (cf. the long list of themes in De 
salt. 37-61). Like the orator the dancer must attempt to achieve a maximum of clar¬ 
ity (aac(>f|veia) in his performance in order to make it comprehensible to his audi¬ 
ence (De salt. 62). Dancing is said to unite pleasure (to Tepuvov) and usefulness (to 
Xpf|aipov) in a unique way because it impresses the spectators not only by its harmo¬ 
nious movements but at the same time teaches them about the past (De salt. 71). A 
foreign guest of the Roman emperor Nero particularly admired an extremely talented 
pantomime dancer who lived at his court. While this foreigner did not understand 
the text of the song that accompanied the dance of the pantomime, it was the dancer’s 
highly illustrative performance which permitted him to follow the entire plot without 
difficulties. When Nero asked the guest what he wished as a farewell gift, the foreigner 
opted for the dancer who by his nonverbal art would serve him as an interpreter 
in negotiations with other peoples (De salt. 64). Athenaeus, another Greek writer, 
reports on another dancer called Memphis who explained Pythagorean philosophy 
more clearly by his dancing than others by the use of words (1.20 c-d). It seems quite 
likely that such statements go back to or are at least inspired by the treatment of the 
dance in Plato’s Laws (esp. 2.653 D9-654 A3, 2.672 d, 2.673 C11-D5,7.816 A3-7; cf. also 
7.814 Dio-815 D5). 
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That speaking with one’s hands is sometimes the only way to make oneself under¬ 
stood to foreigners was an experience which the poet Ovid had when he was ban¬ 
ished to Tomis, nowadays the Romanian seaside resort Constanta (on this more 
extensively Fogen 20oo:37ff). According to his own account, he communicated with 
the inhabitants of Tomis by gestures: 

exercent illi sociae commercia linguae: 

per gestum res est significanda mihi. 
barbarus hie ego sum, qui non intellegor ulli. (Tristia 5.10.35-37) 

‘They hold intercourse in the tongue they share; 

I must make myself understood by gestures. 

Here it is I that am a barbarian, understood by nobody.’ 

Admittedly these verses as well as the entire description of the Black Sea in Ovid’s 
two collections of poems Tristia and Epistulae ex Ponto have to be judged as an exag¬ 
geration based upon the literary topos of the locus horribilis. Distrust of Ovid’s por¬ 
trayal of Tomis is even more apposite as on other occasions he refers to his study of 
the Getic and Sarmatian language and even to poetic works in Getic from his own 
pen. Nonetheless, the passage cited contains a constant trait of the Roman view of the 
mother tongue and foreign languages. Here as in other sources, we find expressed an 
attitude which is still very widespread in the first century a.d.: in regarding Rome, 
the capital of the empire, as the centre of the world, many Romans looked down on 
everything non-Roman; this includes languages with the exception of Greek. 

Gesture and facial expression are similarly classified as universal language in 
the context of first-language acquisition. In his Confessions, Augustine conveys 
some highly informative insights concerning his own linguistic socialization ( Conf. 
1.6.7-1.19.30) 11 : As a baby ( in-fans: ‘speechless’), he tells us, he explained his wishes 
and desires to his environment by weeping, smiling, thrashing and screaming—by 
signs, as he adds himself, which corresponded to his wishes. However, such signs were 
not sufficiently unambiguous and sometimes caused misunderstandings among the 
adults around him; when they disobeyed him, little Augustine took revenge by crying 
( Conf. 1.6.8). The gift of the yet inarticulate infant to communicate his emotions non¬ 
verbally in these forms is viewed by Augustine as a peculiarity of man which has 
been granted by God ( Conf. 1.6.10-1.7.12). Next he reports how he acquired the verbal 
expressions for things: 

cum ipsi appellabant rem aliquam et cum secundum earn vocem corpus ad aliquid 
movebant, videbam et tenebam hoc ab eis vocari rem illam quod sonabant cum 
earn vellent ostendere. hoc autem eos velle ex motu corporis aperiebatur tamquam 
verbis naturalibus omnium gentium, quaefiunt vultu et nutu oculorum cetero- 
rumque membrorum actu et sonitu vocis indicante affectionem animi inpetendis, 
habendis, reiciendis fugiendisve rebus. (Conf 1.8.13) 
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‘When they named anything, and when at that name they moved their bodies toward that 
thing, I observed it and gathered thereby that that word which they then pronounced was 
the very name of the thing which they showed me. And that they meant this or that thing, 
was discovered to me by the motion of their bodies, even by that natural language, as it 
were, of all nations, which expressed by the countenance and cast of the eye, by the action 
of other parts, and the sound of the voice, discovers the affections of the mind, either to 
desire, enjoy, refuse, or to avoid anything.’ 

A similar point can be found in Lucretius’ theory of the beginnings of civilization, 
expounded in the fifth book of his didactic poem De rerum natura 12 . He is convinced 
that the invention of language was not the work of an individual because it is not 
plausible that this ability should have been given only to a single human being. More¬ 
over, how should such a ‘first inventor’ have taught his created expressions to others 
(De rer. nat. 5.1041-55)? Instead, one has to assume that in the beginning man uttered 
various sounds for different impressions and feelings and that, in doing so, he con¬ 
trasted the things he wished to denote (De rer. nat. 5.1056-90). In its initial stage, 
human language was comparable to the articulation of infants, who, to illustrate their 
messages, fall back upon gestures and point to things they refer to, using their fingers 
(De rer. nat. 5.1028-33, cf. already 5.1021-23). 

Despite many attempts in ancient literature at comprehending elements of non¬ 
verbal communication as a medium of international communication, the ancients 
never attempted to construct a verbal equivalent, an international auxiliary language 
or planned language like the modern Esperanto or Volapiik 13 . This fact is easily 
explained: On the one hand, Greek and, later, Latin have the status of world languages 
in the entire Mediterranean world, not only by the self-definition of their respective 
native speakers but as a matter of fact. On the other hand, it would have been com¬ 
pletely unthinkable for a Greek or a Roman to construct an artificial language as a 
replacement for a historically evolved idiom simply for the purpose of communication 
with other nations; it was the pronounced ethno- and linguocentrism that militated 
against such an idea right from the beginning 14 . Moreover, well-directed attempts at 
surmounting otherness and the striving for intercultural exchange enhanced by the 
learning of foreign languages are alien to ancient thinking, even if the idea of cosmo¬ 
politism and the equality of men found its supporters in some philosophical schools, 
notably the Stoics and Cynics 15 . Still, it has to be borne in mind that, at least for the 
Roman world, the naive opinion that the structure of the mother tongue is given by 
nature and the only one which is ‘correct’—a view which equates ‘mother tongue’ and 
‘language’—was not disseminated in the literary sources we have. This is not least 
a result of the Greco-Latin bilingualism in the Roman upper-class which had been 
increasingly noticeable since the second century b.c. 

2. gesture, facial expression and voice in rhetoric. Theoretical aspects of non¬ 
verbal communication are most extensively dealt with by ancient authors in rhetori¬ 
cal treatises as prescriptive and normative instructions for the orator to enable him to 
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be as persuasive as possible over and above what is achieved by linguistic and stylistic 
means. It is evident that, among the orators five tasks, gesture, facial expression and 
voice quality form part of delivery (uTtoKpioic; resp. actio or pronuntiatio). The impor¬ 
tant role delivery played among the orator’s tasks ( officia oratoris ) can be gathered 
from an anecdote about Demosthenes which is handed down by many Greek and 
Roman authors: He is said to have attributed first, second and third rank to delivery 16 , 
and in this he is followed by later theoreticians (see e.g., Rhet. Her. 3.19; Cicero, De 
orat. 3.213). Most Greek writings on gesture, facial expression and voice quality have 
unfortunately been lost 17 so that in the majority of cases we have to content ourselves 
with reconstructing Greek theory from Roman sources, not all of which have sur¬ 
vived either 18 . In the following, I shall concentrate on passages from four Latin works, 
singling out no more than the most important aspects: the anonymous Rhetorica 
ad Herennium (3.19-27), Cicero’s Orator (54-60) and De oratore (esp. 3.213-27), and 
Quintilian’s Institutio oratoria (esp. 11.3). All these writers divide delivery into voice 
quality (vox) on the one hand, and gesture ( gestus ) and facial expression ( vultus ) on 
the other. Accordingly, delivery was termed either pronuntiatio with emphasis on the 
vocal aspects or actio with reference to the use of body language (Rhet. Her. 3.19; 
Cicero, Orator 55 andDe orat. 3.213-27; Quintilian, Inst. orat. 11.3.1,11.3.14). 

With the exception of Quintilian, the writers surveyed pay much more attention to 
the use of the voice than to the use of the body. Since by the different modulations of 
his voice the orator evokes certain emotions in his audience, he has to adjust his voice 
to the contents of his speech and to the reactions he wants to elicit from his listeners. 
This intention requires the chosen pitch to be in harmony with the orator’s character 
(particularly remarkable Rhet. Her. 3.22). The state of mind the voice evokes in the 
audience ought to be reinforced by corresponding gestures and facial expressions of 
the orator. The principle of appropriateness ( aptum ) is equally vital for the use of ele¬ 
ments of nonverbal communication as it is on the verbal level, where the choice 
of words, the stylistic levels and registers must be selected in accordance with the 
topic of the orator’s speech, his intention, the make-up of his audience and other 
external circumstances, but also with his own personal make-up 19 . Elements of ges¬ 
ture and facial expression should fit the orator’s individual personality and at the 
same time match the requirements of the oratorical profession, whose representatives 
are expected to be concerned about their dignity ( dignitas ) and authority (gravitas). 
For that reason, the orator should not behave like an actor on stage, even if dramatic 
art may sometimes provide the orator with ideas for the shaping of his speeches 
especially at the nonverbal level (Rhet. Her. 3.26; Cicero, De orat. 3.220; Quintilian, 
Inst. orat. 11.3.103,123,125, 181-84, esp. 184 on the situation in Quintilian’s era). In 
connection with this, Quintilian recommends a restriction to gestures which accom¬ 
pany speech in a natural way (‘illustrators’ in the terminology of Ekman and Friesen) 
and suggests that gestures that substitute the spoken word (‘emblems’ in Ekman’s and 
Friesen’s terms; cf. 1969: 63-70, and 1979: niff., 118) should be avoided because they 
belong to the domain of actors in the theatre (Inst. orat. 11.3.88-91). The reason for 
such a demand is apparent, if we consider the social stratification of Roman society. 
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Actors and pantomimes usually belonged to the lower stratum; their profession was 
associated with ill repute, a lack of moral restraint and even indecency which at least 
officially conflicted with the strict morality of respectable Roman citizens (cf. Wiist 
1949: esp. 860-63; Rotolo 1957: 49k, 63-67). 

In rhetorical writings, the significance of facial expression for the orator and in 
particular the expression of the eyes are firmly emphasized. For example, Cicero 
remarks 20 : 

Sed in ore sunt omnia, in eo autem ipsi dominatus est omnis oculorum; ...animi 
est enim omnis actio et imago animi vultus, indices oculi: nam haec est unapars 
corporis, quae, quot animi motus sunt, tot significationes et commutationes possit 
efficere; neque vero est quisquam, qui eadem conivens efficiat. (De orat. 3.221) 

‘But everything depends on the face, while the face itself is entirely dominated by the 
eyes... For delivery is wholly the concern of the feelings, and these are mirrored by the 
face and expressed by the eyes; for this is the only part of the body capable of producing 
as many indications and variations as there are emotions, and there is nobody who can 
produce the same effect with his eyes shut.’ 

As regards gestures, Quintilian believes that the hands are most important, since with 
them the orator can transmit messages in a way that is nearly as sophisticated as the 
use of words. In denoting places and persons, the hands take on, as it were, the func¬ 
tion of adverbs and pronouns. Since with them one can describe an incredible range 
of facts, Quintilian thinks that it is no exaggeration to understand gestural commu¬ 
nication which makes use of the hands as a language common to all human beings 
(Inst. orat. 11.3.85-87). 

3. conclusion. I do not want to conclude my overview without mentioning that I 
have presented a very selective treatment of the subject. Among the ancient treatises 
on rhetoric, for example, particularly the eleventh book of Quintilians Institutio ora- 
toria would be worth a separate analysis (cf. as a first start Maier-Eichhorn 1989), 
since it is the only extant comprehensive treatment of the use of gestures in speeches. 
In addition to such large-scale treatises, there are numerous ancient texts which con¬ 
tain some valuable information on the more theoretical aspects of the use of gestures 
and facial expression. Let me just refer to the famous passage in Plato’s Cratylus (esp. 
422 Dii-423 B7) or to the Younger Senecas prose writings. 

But even without this, I hope to have demonstrated that the theorizing on non¬ 
verbal communication by ancient authors starts from various perspectives. The basis 
of these reflections are, first, hypotheses on the origin of language; second, consider¬ 
ations about the first-language acquisition of the child; third, remarks on the problem 
of foreign languages and, connected with this, the view of gesture and facial expres¬ 
sion as a universal language of all human beings; and fourth, the elaboration of rhe¬ 
torical doctrines for the purpose of a skilful use of nonverbal elements in public 
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speech. It is evident that whatever perspective the different writings may take, they 
also rely upon sign theory and discuss the status of signs. A systematic compilation 
of theoretical reflections on nonverbal communication in ancient Greek and Roman 
authors is still a desideratum; it is the aim of my latest research project to fill this gap 
in a more extensive study. 


On the interdisciplinary framework of scholarship on nonverbal communication e.g., 
Noll (1998:156). 

On the history of theorizing on gesture see the most recent overview in Muller (1998: 
25-85); the author includes antiquity, in particular Quintilian. 

The most exhaustive study is still Sittl (1890), most recent monographs by Aldrete (1999) 
and Boegehold (1999). Shorter accounts can be found in Wohrle (1990), Bremmer (1991), 
Graf (1991), and Lateiner ( 3 i996). The following studies are concentrated on single ancient 
authors: Lateiner (1987), Lateiner (1995), Maier-Eichhorn (1989), and Lobe (2000). Only 
very briefly on antiquity the article by Kosenina (1996: esp. 565-67). 

See in particular Deonna (1914), Zschietzschmann (1924), Jucker (1956), Brilliant (1963), 
Neumann (1965), Bogen (1969), and Fehr (1979). 

Regarding Rome on the political level as the model for everything else, the Romans were, 
on the whole, not very open-minded about anything non-Roman. A knowledge of foreign 
languages extending beyond Greek is not very often a topic of Latin writers; sometimes 
the ability to manage one foreign language is already regarded as a remarkable achieve¬ 
ment. Important sources are e.g., Quintilian, Inst. orat. 11.2.50: Ceterum quantum natura 
studioque valeat memoria, vel Themistocles testis, quem unum intra annum optime locutum 
esse Persice constat, vel Mithridates, cui duas et viginti linguas, quot nationibus imperabat, 
traditur notasfuisse, vel Crassus ille dives, qui cum Asiaepraeesset, quinque Graeci sermonis 
differentias sic tenuit, ut, qua quisque apud eum lingua postulasset, eadem ius sibi redditum 
ferret... (‘For the rest there are many historical examples of the power to which memory 
may be developed by natural aptitude and application. Themistocles is said to have spoken 
excellently in Persian after a year’s study; Mithridates is recorded to have known twenty-two 
languages, that being the number of the different nations included in his empire; Crassus, 
surnamed the Rich, when commanding in Asia had such a complete mastery of five differ¬ 
ent Greek dialects, that he would give judgement in the dialect employed by the plaintiff in 
putting forward his suit’); on Themistocles cf. already Thucydides 1.138 and Nepos, Them. 
10.1, on Mithridates similarly Pliny Nat. hist. 7.88, 25.6, and Gellius, Noct. Att. 17.17.2, and 
on P. Licinius Crassus Dives cf. Valerius Maximus 8.7.6. Also relevant is Gellius, Noct. Att. 
17.17.1: Quintus Ennius tria corda habere sese dicebat, quod loqui Graece et Osce et Latine 
sciret. (‘Quintus Ennius used to say that he had three hearts, because he knew how to speak 
Greek, Oscan, and Latin.’) 

On the problem of foreign languages in Greco-Roman antiquity see Fogen (2000: esp. 
31-60), with all relevant secondary literature (see also pp. 242-47). 

Cf. also Weinrich (1985:198): ‘Wahrscheinlich ist die Rat- und Hilflosigkeit gegeniiber 
einem anderen Menschen, mit dem man sich nicht verstandigen kann, eine menschliche 
Grund- und Grenzerfahrung, bei der Andersheit und Fremdheit zusammenfallen, wenig- 
stens auf den ersten Blick.’ 
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8 The importance of the societas patrii sermonis is similarly emphasized in Anonymus, Rhet. 
Her. 4.37: nihil Fregellanis morum et sermonis societas opitulata est (‘of no avail, to the Fre- 
gellans was fellowship with us in customs and in language’), and in Cicero, Verr. 2.5.167 
(taken up by Aulus Gellius, Noct. Att. 1.7.2): ...apud cives... Romanos, qui et sermonis et 
iuris et multarum rerum societate iuncti sunt (‘...among Roman citizens, who are united 
by the common bond of language, rights, and many interests’), and later on Charisius, GL 
1.50.15-16 Keil (= 62.2-4 Barwick): Latinus vero sermo cum ipso homine civitatis suae 
natus significandis intellegundisque quae diceret praestitit (‘But the Latin language, come 
into existence at the same time as the Roman citizen, succeeded in expressing its message by 
conveying meaning and having it understood’). That such a view was not restricted to the 
Romans, can be gathered from Curtius Rufus, Hist. Alex. Mag. 6.10.23, who lets the Mace¬ 
donian Philotas point out: Mihi quidem obicitur quod societatem patrii sermonis asperner, 
quod Macedonum mores fastidiam (‘It is even charged against me that I scorn association 
with my native language, that I disdain the customs of the Macedonians’). Cf. also the vice- 
versa relation in Quintilian, Inst. orat. 8.1.3: ...si fieri potest, et verba omnia et vox huius 
alumnum urbis oleant, ut oratio Rotnanaplane videatur, non civitate donata (‘...if possible, 
our voice and all our words should be such as to reveal the native of this city, so that our 
speech may seem to be of genuine Roman origin, and not merely to have been presented 
with Roman citizenship.’). Language as a vital power, which re-enforces the sense of com¬ 
munity, in the following case of the Athenians as belonging to the larger body of the 
Hellenes, is—apart from common religious customs and origin (ties of blood)—already 
mentioned in Herodotus 8.144 (cf- also 2.56f.). For the later period see Dionysius Halicar¬ 
nassus, Antiqu. Rom. 1.89.3. 

9 An excellent summary of the different positions from universalism to relativism can 
be found in Wallbott (1979) and Wallbott (1993); see also the short remarks in Clyne 
(1975:162-63). Supporters of a moderate universalism include Ekman (1970) and Ekman 
and Friesen (1979:111-12,118). The universalist hypothesis was first and foremost rejected 
by Birdwhistell (1970, esp. 34). 

10 On the problem of the authorship and date see Robertson (1913): the work was written 
between 162 and 165 during Lucian’s stay in Antiochia; with it, the author wanted to 
win over the emperor Verus, who was enthusiastic about pantomimes (Script. Hist. Aug., 
Verus 8 and 10-11.). This view is shared by Robert (1930, esp. 120-22); in contrast Bier 
(1917) suggests a date between Domitian and Marcus Aurelius. That the work is not by 
Lucian is assumed by Helm (1906:3690. On parallels in Aristides and Libanius cf. Mesk 
(1908:59-74). A useful study on the ancient pantomime dance in general is, apart from Bier 
(1917), Rotolo (1957) who also collects relevant epigraphic and epigrammatic sources. 

11 See the commentaries by O’Donnell (1992) and Clark (1995). Also important is Marrou 
(1981). 

12 On language origin in Epicurus ( Ep. ad Herod. 75-76) and Lucretius see the following 
studies: de Lacy (1939), Long (1971), Offermann (1972), Schrijvers (1974), Pigeaud (1983), 
Ferguson (1987), Pfligersdorffer (1988), and Hossenfelder (1991). 

13 Cf. Schwyzer (1902) and Rochette (1995:10-11). 

14 Cf. also Pei (1958:83): ‘Universality was implicit in the Graeco-Roman world. To its mem¬ 
bers that world constituted the universe, despite the vague awareness of a world beyond, 
...The Roman Empire, once constituted, became a powerful, living reality whose limes was 
almost as much of a boundary to the ancients as the stratosphere is to us.’ 

On this see Baldry (1965) and Lana (1951). 
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16 Cicero, Brutus 142, Orator 56, De orat. 3.213; Valerius Maximus 8.10 ext. 1; Quintilian, 
Inst. orat. 11.3.6-7; Philodemus, Rhet. 1.196 Sudhaus; Longinus, Rhet. Gr. 1.2.195 Spengel/ 
Hammer; Plutarch, Mor. 845B. See also Dionysius Halicarnassus’ remarks on Demos¬ 
thenes’ intensive voice and body exercises in Dem. 53. 

17 Brief treatments of this are Maier-Eichhorn (1989:7-14) and Katsouris (1989:26-33). On 
Theophrastus see Fortenbaugh (1985). The non-existence of more extensive writings on 
actio is mentioned in Rhet. Her. 3.19 (cf. also 3.27 fin.). 

18 E.g., Quintilian, Inst. orat. 11.3.143, refers to writings on gesture by Plotius Gallus and 
Nigidius Figulus. 

19 Cf. Fogen (1999:5-9), similarly Fogen (2000:119-22). In addition, see Quintilian, Inst. orat. 
11.1,11.3.61-65,11.3.150-53, and further 11.3.177-84, esp. 180. 

20 Similarly Quintilian, Inst. orat. 11.3.75-77, esp. 75: sed in ipso vultu plurimum valent oculi, 
per quos maxime animus emanat, ut citra motum quoque et hilaritate enitescant et tristi- 
tiae quoddam nubilum ducant. quin etiarn lacrimas iis natura mentis indices dedit, quae 
aut erumpunt dolore aut laetitia manant. motu vero intenti, remissi, superbi, torvi, mites, 
asperi fiunt: quae, ut actus poposcerit,fingentur. (‘But of the various elements that go to form 
the expression, the eyes are the most important, since they, more than anything else, reveal 
the temper of the mind, and without actual movement will twinkle with merriment or be 
clouded with grief. And further, nature has given them tears to serve as interpreters of our 
feelings, tears that will break forth for sorrow or stream for very joy. But, when the eyes move, 
they become intent, indifferent, proud, fierce, mild, or angry; and they will assume all these 
characters according as the pleading may demand.’) 
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THE GRAMMAR OF THE PICTISH SYMBOL STONES 
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the picts inhabited most of Scotland in the first millennium of the Common Era. 
While we know from such evidence as place names (Nicolaisen 1972,1996) that they 
spoke a Brythonic Celtic language, aside from these place names, the king lists (see 
Cummins 1995), and a handful of ogham inscriptions that have so far eluded decod¬ 
ing (but see Cummins 1999:60-68), nothing else is known of their language (compare 
Forsyth 1996,1997a). 

1. the PICTISH symbol stones. The most extensive and most identifiably Pictish arti¬ 
facts that these people left behind are the symbols that they carved on a number of 
stones and other articles. These display a remarkable degree of consistency and are 
reproduced in typical forms in Table 1 (overleaf, from the compilation of Sutherland 
1997, who draws her figures from Allen & Anderson 1903). 

Where the interests of linguistics enter the picture is in the enigmatic combi¬ 
nations of these symbols throughout Pictland. Usually in pairs, one over another, 
these combinations strongly suggest the communication of some form of informa¬ 
tion adhering to a set of rules. As Thomas points out: 

Statements containing two symbols, or Pairs, do not appear to conform to any 
random distribution (this could almost certainly be demonstrated mathemati¬ 
cally by a simple matrix analysis) and it may be assumed that selective rules are 
at work. There is a noticeable emphasis on the use of Object, rather than Animal, 
symbols. Repetition of complete pairs exists, and among the actual symbols used, 
both Animal and Object, certain symbols occur disproportionately more often 
than others. (Thomas 1963:37) 

Such observations indicate to the linguist that some form of rule-based grammar 
is involved; and one mathematical matrix has been worked out by Jackson (1984), 
although with an interpretation that is not widely accepted (Forsyth 199713:85). Of 
course, before we attempt to ascertain what they might have meant, we must first 
determine the level of grammar on which this information was being conveyed. 

1.1 the semantic level. The first (and perhaps the most obvious) level suggested 
for the grammar of the Pictish symbol stones is the semantic, or more precisely the 
lexico-semantic. In this approach, each symbol relates to some precise designation 
that maintains a particular meaning. 
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Table 1. Pictish symbols. 


Thomas (1963) interprets the symbols as funerary. Since there are not enough such 
symbols to represent individuals or indeed individual names, he suggests that they 
‘must perforce do this in terms of... status and... group-affiliation (Thomas 1963:87). 
Moreover, he posits quasi-grammatical rules to account for their distribution, consid¬ 
ering them to be personal nouns and adjectives. 

Henderson (1971), while accepting the semantic and even the grammatical basis 
for the symbols, suggests that they may have referred rather to the ownership of 
land. Along somewhat similar lines, Jackson (1984) expands upon the grammatical 
basis and presents an impressive hypothesis that the paired symbols represent mar¬ 
riages within the Pictish matrilineal system. Unfortunately, the hypothesis suffers 
from some rather serious problems (Driscoll 1986). 
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More recently, Samson (1992) proffers the hypothesis that the symbols alone reveal 
nothing—only the symbols in combination have significance. Each combination sup¬ 
posedly refers to an individual memorialized on the stones. Indeed, such pairing of 
thematic elements does reflect the construction of names used in contemporary Irish 
and Anglo-Saxon, and a statistical analysis of such elements appears to support the 
hypothesis. The drawback to this approach, however, lies in the fact that these symbols 
do occasionally occur singly; nor are they restricted to funerary contexts, but are even 
found ‘on a number of portable objects, including a silver pin and silver plaques from 
Nome’s Law, Fife, and the terminal of the silver chain from Whitecleuch, Lanarkshire’ 
(Ritchie & Ritchie 1981:164; see also 172-74). 

1.2 the orthographic level. One significant breakthrough in Samson’s argument 
is that it takes the debate off of the semantic level per se and onto a morphological 
one. As noted by Forsyth, ‘for the first time... the symbols represented language rather 
than ideas’ (199711:87). This suggests to her that the symbols might reside on a level of 
writing—an orthographic level. 

In examining how such a system might work, however, Forsyth has to acknowl¬ 
edge serious problems for each type of known orthographic system. First of all, a 
logographic system would not work because there are simply not enough symbols. 
While the number of symbols would certainly support a syllabary, Pictish words and 
names were not restricted to two syllables each. Likewise, although the number of 
symbols could support an alphabet, the ‘brevity of the extant texts ... all but precludes 
an alphabetic interpretation for Pictish symbols (unless one is prepared to accept 
the unlikely scenario that the symbol statements represent pairs of initials)’ (Forsyth 

I997b:93). 

2. an appeal to context. The greatest hindrance to a linguistic interpretation of the 
symbol stones has been a lack of appropriate context. Harkening back to linguistics 
in the early 20th century, we recall that in order for communication to be compre¬ 
hended and analyzed effectively, we first need to know the context of situation.’ As 
noted by Firth, 

The basic assumption of the theory of analysis by levels is that any text can be 
regarded as a constituent of a context of situation or of a series of such contexts, 
and thus attested in experience, since the categories of the abstract context of 
situation will comprise both verbal and non-verbal constituents and, in renewal 
of connection, should be related to an observable and justifiable grouped set of 
events in the run of experience. (Firth 1968:175) 

Indeed, there has been some recent evidence that may shed light upon the intended 
context of situation for these symbols, and hence upon the appropriate level of gram¬ 
mar for analysis. 
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Figure 1. Meigle 10. 

2.1 the cult of the archer guardian. While a full explanation of the context for 
the Cult of the Archer Guardian is not possible in this forum, a brief summary of 
some of the evidence may suffice. In a recumbent stone known as Meigle 10 (follow¬ 
ing the enumeration of Allen and Anderson 1903), we find a situation depicted by 
Chalmers (1848) as in Figure 1 (from Ritchie 1997:121). 

As explained with detailed justification elsewhere (see Griffen 2000), the figures 
being transported in the formal processional vehicle—a carpentum (see Laing & Laing 
1984:277-78)—are souls undergoing the translation to the otherworld according 
to widely held beliefs in the Pagan Celtic religion (compare, for example, Green 
1986:121-37). The scene in the lower right shows the alternative fate of the soul—its 
capture by what we may term the Decapitating Beast through the removal of the 
persons head. In the Celtic tradition, the head is the home of the soul, and both 
the role of the head and its decapitation are also well attested in the Celtic tradition 
(Ross 1996:94-95). In between these two scenes, we see the Archer Guardian, who 
protects the otherworld and those being translated from the ravages of the Decapitat¬ 
ing Beast. 

In the Christianization of Pictland, the Archer Guardian was taken over as the 
one who guards heaven for the elect, at least according to the doctrine of predestina¬ 
tion, which was in fact prevalent in Britain at the time (see Hardinge 1972:61). We 
find rather clear evidence of this syncretism in the Ruthwell Cross in Figure 2 (detail 
from Meyvaert 1992: figure 4), in which the Archer Guardian is shown in the lower 
panel of the cross, metaphorically between heaven and hell. The Celtic sacred eagle, 
appropriately enough, is in the upper panel, for its ‘vast wing-span and powerful, high 
flight epitomize the huge span of the sky’ (Green 1986:188). 

2.2 the v-rod and the z-ROD. When we examine the symbols in table 1, we see that 
there are two types that stand out—those with the so-called V-rod and Z-rod (repro¬ 
duced at the beginning of the table for ease of reference). The V-rod is quite clearly an 
arrow that has been bent to an angle of about 90° and placed over a crescent. The cres¬ 
cent is widely regarded as a symbol of death (compare Sutherland 1994:103-107), and 
the arrow is bent in precisely the manner that one would bend a sword or other such 
votive offering before depositing it—a ritual ‘killing’ of the object (compare Green 
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Figure 2. The Ruthwell cross. 

1 995 : 47 °- 7 1 > Maier 1997:241). This very widespread combination—the only one with 
a V-rod—is thus quite consistent with the cult of the Archer Guardian and probably 
represents a symbolic invocation of the Archer perhaps upon the death of an indi¬ 
vidual. 

The Z-rod occurs superimposed upon only three symbols (the second through the 
fourth symbols, respectively, in table 1): The ‘notched rectangle’ is widely regarded as a 
chariot, and as such it is completely consistent with the carpentum in which the soul 
is translated to the otherworld or to heaven (compare Thomas 1963:52-53); the snake 
is a particularly holy creature in the Pagan Celtic religion because it travels between 
this world and the otherworld (in its underworld aspect—Sutherland 1994:89-90); 
and the double-disk symbol is a double-sun representation that in its broadest inter¬ 
pretation can certainly represent this world and the otherworld and the path in 
between (compare Sutherland 1994:107-108). 

When we examine the Z-rod in these contexts, we find that they are all very 
much consistent with the cult of the Archer Guardian. The rod itself is an arrow (or 
spear) bent twice so as to define the boundary between this world and the other- 
world—the boundary traversed by the carpentum and the snake, the boundary on 
the path between the two suns. Moreover, the bends allow it to take up its flight in 
the same direction on this side as it began on the other side. It can thus be seen as 
representing the frontier defended by the Archer Guardian. 

2.3 other symbols. Once we establish the religious nature of these four symbols, 
other Pictish symbols fall into place with known Celtic religious beliefs. For example, 
the eagle (or any bird) was regarded as holy because it spoke a language of the other- 
world and could also fly to and from the otherworld. Moreover, as noted above, the 
eagle’s vast wing-span represented the sky itself. Other symbols will be treated further 
below in section 3.2. 

Thus, we can readily surmise that the context of these symbols is intended to 
be religious. Moreover, it is consistent with the general religious beliefs for which 
we have evidence from throughout the Celtic world (as we would expect—compare 
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Laing & Laing 1993:21). With the context of situation thus defined, we can now turn 
our attention to the issue of the grammar of the symbols. 

3. the grammar of the symbols. As noted above, the symbols are usually written 
together in pairs, although single symbols and three or even four symbols may also be 
found (beyond this, we are probably working with groups of two or three). Accord¬ 
ingly, the level of grammar should allow for one unit in isolation, for two units as 
the preferred configuration, and for an extension to three or four units. Furthermore, 
where the V-rod is used, the unit must be consistent with the invocation of the Archer 
Guardian (or of God in the syncretic period) on behalf of a deceased—or in some 
other dedicatory context, as this interpretation need not be restricted to funerary situ¬ 
ations (no more than a cross on a necklace must correspond to one over a grave). 

3.1 the clause as asymbolic unit. Quite important to the level of grammar, the unit 
must also be appropriate to a religious context; and where the V-rod is used, it must 
be appropriate to what one may be expected to say in an invocation to the Archer 
Guardian, especially (though not necessarily) where a death has occurred. The only 
Pagan Celtic invocation extant occurs at the beginning of the Tablet of Chamalieres: 

I beseech the very divine, the divine Maponos Avernatis by means of the magic 
tablet: quicken(?) us, i.e. those (named below) [sic] by the magic of the under¬ 
world spirits!?). (Koch 1995:3) 

Of course, in the process of syncretism. Paganism adopted rituals and concepts 
from Christianity‘to supply its own deficiencies’ (De Reu 1998:27), especially in mat¬ 
ters of liturgy. The closest type of prayer in the Christian tradition to an invocation of 
the Archer Guardian would be the collect. We find just such a prayer in the introduc¬ 
tory collect of the Torrha (or Stowe) Missal: 

O God, Who to Blessed Peter Thine Apostle didst bestow by the keys of the 
Heavenly Kingdom, the power to bind and loosen souls, and didst give the office 
of High Priest, receive our prayers of propitiation and his intercession. We ask 
O Lord for help that we may be freed from the bonds of our sins through our 
Lord Jesus Christ Who reigneth with Thee and the Holy Spirit, God throughout 
all ages of ages. (Dowling 1995) 

When we reduce the collect to its essential formula, we find that it always expresses 
the construction: 

(may) god [who...] do this 

Thus, we have two clauses, a petitionary clause with an embedded relative clause 
modifying the subject being invoked at the beginning of the prayer. The relative 
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clause identifies the attributes of God most appropriate to the type of petition. This 
opening complex sentence may then be followed by further, related petitions, creating 
in effect a compound complex sentence. 

Such a formula is furthermore quite consistent with Pagan thinking, for we are 
naming God and asking God to fulfill our petition. The Pagans ‘believed in the gods 
because of the help they offered: they delivered prosperity, strength, a plentiful har¬ 
vest, good fishing and victory in battle (De Reu 1998:13-14). Indeed, the Pagan could 
make a petition directly to the intended divine being, without the awkward require¬ 
ment of praying only to God while invoking the intercession of a saint. 

What is of greatest importance to the linguist, of course, is the fact that we can 
identify the context of situation as originally a Pagan prayer invoking the Archer 
Guardian and making a petition. From the internal linguistic, grammatical perspec¬ 
tive, moreover, we can further identify the formula as consisting of clauses in a com¬ 
plex or compound complex sentence. Thus, our level of linguistic analysis for the 
Pictish symbol stones is the clause structure. 

This would seem reasonable, for the clause can stand on its own, providing inde¬ 
pendent information in the appropriate context—an invocation without petition or a 
petition without invocation. The usual form for a collect, however, is the main clause 
with the petition and the relative clause with the attributes—thus accounting for the 
majority of Pictish representations with two symbols/clauses. Where only two sym¬ 
bols occur, they are probably in the form of a general petition, asking for a blessing or 
protection in accordance with the first sentence of the Lorrha Missal’s collect above. 

Moreover, in the collect from the Lorrha Missal, we go beyond the general peti¬ 
tion, introducing specific petitions appealing to the attributes described in the rela¬ 
tive clause. If the Pictish symbol stones do indeed represent Pagan prayers parallel to 
such collects, this extension of the petition from general to specific would account 
for the additional symbols. In these cases then, we end up with one or more further 
symbols/clauses. 

3.2. interpreting the symbol stones. Thus far, we have the religious context of 
situation, transparent meanings for the symbols directly connected with the Cult of 
the Archer Guardian, fairly transparent meanings for some of the other symbols, and 
the level of grammar consisting of clauses used in invocation and petition. 

At this point, then, we can suggest meanings for some of the more transparent 
combinations. Since these combinations appear on stones often with a funerary 
intent, we can supply a tentative structure, following that of the collect above, such as 
the following: 

may the archer guardian, who..., 

protect the departed upon their journey 

Thus, the relative clause appropriate for the Z-rod and double-disk may be ‘who 
guards the border between this world and the otherworld,’ that for the arch may be 
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Double-disk 
and Z-rod 

who guards the border between this world and 
the otherworld 

Arch 

who guards the entrance to the otherworld 

Notched rectangle 

who guards the vehicle of translation 

Notched rectangle 
and Z-rod 

who guards the vehicle of translation in its 
journey to the otherworld 

Crescent 

who watches over the dead/over archers 

Pictish Beast 

? 

Cauldron 

who enables resurrection in the otherworld 

Snake and Z-rod 

who guards the passage to the otherworld 

Rectangle 

? 

Double-disk 

who guards the passage between this world and 
the otherworld 

Circle 

who ensures life everlasting 


Table 2. Possible interpretations of comginations with the crescent and V-rod. 

‘who guards the entrance to the otherworld,’ and so forth. Some possible (and highly 
tentative) interpretations based upon the broader Celtic symbolism are provided in 
Table 2. 

Where the crescent and V-rod come second, we would have to assume (following 
the formula for the collect) that the prayer is being made to the first element. For 
example, the combination of the eagle with the crescent and Z-rod may be interpreted 
as an invocation of the eagle, whose wingspan covers the earth and encompasses the 
Archer Guardian. Of course, some petitions do not involve the Archer Guardian at all. 

4. conclusion. Without a context of situation, the identification of the appropriate 
level of linguistic analysis for the Pictish symbol stones is quite frankly impossible. 
While previous researchers may have been able to rule out certain grammatical levels, 
they have had no way of determining the level itself. Indeed, if one had suggested pre¬ 
vious to the identification of the Cult of the Archer Guardian that the interpretation 
be religious and at the clause level, one could have supported that conclusion with no 
greater facility than any other. 

As modern linguists, we tend to take the context of situation for granted, for our 
areas of inquiry have become rather limited to oral and written communication with 
precisely known or‘neutral’ context. It is when we go beyond the elicitation of familiar 
and comprehended verbal language that we see how important knowing the exact 
context, the exact intent and the exact style of communication really is. 
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introduction. A major issue in the study of discourse is how narrative is struc¬ 
tured. Numerous studies have already provided good analyses of various aspects of 
its structure. Longacre (1983:3), for instance, pointed out two main features of narra¬ 
tive discourse. One is what he characterized as the ‘contingent temporal succession of 
the described actions or events in a narrative. This characterization means that these 
actions or events are usually not only temporally ordered but also mutually depen¬ 
dent. Such dependency may be created by causality or it may result from a more gen¬ 
eral sense of coherence. The other main feature of narrative discourse, according to 
Longacre, is its ‘agent orientation. This orientation reflects the fact that narratives are 
typically built around human agents or other animate beings. 

It has also been established that narrative is not composed merely of sequences of 
sentences, but is hierarchically organized into intermediate units such as episodes or 
paragraphs (Bamberg & Marchman 1991; Black & Bower 1979; van Dijk 1982; Grimes 
1975; Hinds 1977,1979; Longacre 1979; Pu, Prideaux & Stanford 1992; Tomlin 1987). 
This fact is generally in line with results from studies that are more focused on the 
issues of how thematic continuity is maintained and how thematic discontinuities are 
signaled in the narrative type of discourse (Brown & Yule 1983; van Dijk & Kintsch 
1983; Givon 1983,1984; Virtanen 1992). 

1. towards an experimental study. Based on all the research mentioned above, the 
present study carried out an empirical investigation of language users’ conception of 
episode transition and its role in the organization of narrative discourse. More spe¬ 
cifically, the present study seeks to determine what the important elements are that 
form the basis of our conception of episode transitions and how such transitions are 
linguistically expressed. 

Empirical support for the validity of episode has been provided by psycholinguistic 
studies which indicate that people recalling stories treat information of an episode as 
an integral unit (cf. Black & Bower 1979; Glenn 1978; Mandler 1978; Thorndyke 1977), 
a result which is further supported by studies that have found the episode-shift effect, 
i.e., readers pause longer in processing episode-initial sentences that introduce an 
episode shift than episode-internal sentences (Haberlandt, Berian & Sandson 1980; 
Mandler & Goodman 1982). Although an episode may not have its boundary marked 
as the indented paragraph, these studies suggest that the boundary of an episode is 
recognizable on thematic grounds. 
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Equally important contributions to our understanding of the structure of narra¬ 
tive have been made by studies using other methodologies. Based on his study of The 
Pear Story data, Chafe (1979,1980) not only demonstrated the relationship between 
major thematic breaks and processing difficulty by using hesitation as evidence, he 
also pointed out that such thematic breaks tend to be associated with a change in 
time, place, character configuration, or event structure. Because the focus of his study 
was to show the relationship between the difficulty of oral production and points of 
thematic shifts, he did not discuss in a comprehensive way the forms of linguistic 
expressions that are used at these points of shift. 

Focusing their attention on the discourse function of preposed subordinate clause 
of time in their experimental research, Prideaux & Hogan (1993) found that in both 
oral and written narratives, the preposed subordinate clause of time (a marked struc¬ 
ture) occurred statistically far more frequently at the beginning of an episode than 
the nonpreposed one (an unmarked structure). The nonpreposed subordinate clause 
of time tended to occur far more frequently within an episode. Their analysis indicates 
that because the preposed subordinate clause is a good device for thematic reorienta¬ 
tion, it is often used to code the beginning of a new discourse unit. Givon (1993:315) 
made the same observation as Prideaux and Hogan (1993) and, in addition he also 
suggested that preposed participial phrases and preposed prepositional phrases may 
serve the same discourse function. 

2 . identifying episode transitions: a segmentation task. As the survey indi¬ 
cated above, the previous studies have, for various reasons, suggested a correlation 
between episode transitions and major changes of such discourse elements as time, 
place, or participant. However, a study is needed to show the empirical situation of 
this correlation based on episode boundaries as identified by ordinary language users. 
If we can acquire some empirical evidence about ordinary language users’ view of 
how narrative develops from one episode to another, we will better understand how 
narrative is structured. The following is the hypothesis for the present study: 

In narrative discourse, episode transitions are primarily defined by major changes 
in time, place, or participant. When the contents of a narrative lack such changes 
as anchorable points of transition, the episode boundary will become less clear. 

To test this hypothesis, an experiment was conducted in which 20 native speakers of 
English and 20 native speakers of Chinese were asked to segment a few narrative texts 
in their own languages. The narrative texts used for both language groups were personal 
accounts of past experience. The three English narrative texts used as stimuli were all 
from The Reader’s Digest. The four Chinese narrative texts were from two major liter¬ 
ary magazines published in China. Each text was given to the participants in the experi¬ 
ment in an unparagraphed form but with a title. The participants were asked to divide 
the texts into episodes by placing a vertical stroke before the first word of the episode. 
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An episode was defined to the participants as a portion of a narrative that relates to an 
event or a series of connected events and forms a coherent unit in itself. 

3. RESULTS AND ANALYSES. 

3.1. sentences in the English high-rate group. After the segmented narratives 
were collected, all the identified sentences were tabulated and those identified by 70% 
or more of the participants were picked out and analyzed as representative episode-ini¬ 
tial sentences because 70% stands for a clear majority. These sentences are categorized 
as the high-rate group. Excerpts 1,2, and 3 provide some examples from this group: 

(1) a. Now, hearing of her elephantine raspberries, I went immediately to Ellies quar¬ 

ters. She looked at me with moist, gray eyes and flapped her ears. “Praaa,” went the 
trunk, (followed by the narrator’s temporary diagnosis and decision) 

b. Back at my office, I phoned a surgical-instrument manufacturer. I asked to borrow 
a flexible endoscope, (followed by an explanation of the use of the instrument and 
the result of getting it) 

c. The next morning I injected Ellie with anesthetic. Five minutes later she sank to 
her knees, unconscious, her trunk still blaring when she exhaled, (followed by the 
narrator’s examination of the inside of Elbe’s trunk) 

(2) a. Before attempting to scale the heights in the real world, I had gone to Chelsea 

Piers, a sports complex in Manhattan, to train on their climbing wall. For fifty 
bucks, I spent two hours learning to tie the classic climbing knots and use the 
equipment, (followed by an account of how the narrator became motivated in 
climbing a real mountain) 

b. Forty feet up a novice climb called Jackie, 1 got stuck between a crack and an 
overhang. I was sure that if I moved either of my feet or my paws, gravity would 
have its way with me. (followed by a description of the narrator’s difficult situation 
and her further actions of climbing) 

c. After I conquered Jackie, Paul took me over to Three Pines, an easier multiple- 
pitch climb—which means climbing cliffs that are higher than your rope is long, 
(followed by an account of the climbing process) 

(3) a. On Monday morning Ellie came to see me again. And again on Tuesday, Wednes¬ 

day, Thursday and Friday. Each time I looked at the trunk and told her how well 
she was doing. On the morning of her tenth office visit, I snipped out the steel 
sutures. The wound had healed and would leave hardly any scar. Ellie must have 
decided that the removal of the stitches indicated that I was signing her off, for she 
didn’t come the next day, nor ever again. 

b. A fellow vet once told me that his father had been prescribed champagne after an 
operation for nasal polyps. “It sounds like sensible medicine,” he had remarked. I 
agreed. So a few days later, I bought a bottle to split in celebration with Ellies keeper. 
The patient, however, did not go unrewarded. Ellie got a large, iced currant bun. 



230 


SHAOJUN JI 


According to the hypothesis of this study, if episode transitions are characterized 
by major changes in time, place, or participant, the episode-initial sentences should 
contain information about these changes. So, an important part of the analysis of 
these sentences is to examine them one by one and find out if they are characterized 
by such changes. Within each episode-initial sentence the sentence-initial structures 
are the focus of our attention. From the point of view of textual development, the 
initial position of the transitional sentence is crucial in the sense that it is easy for 
the element placed here to connect the sentence with what has preceded in the previ¬ 
ous thematic unit. It is also a natural starting point for the rest of the sentence or the 
whole unit it leads. 

Table 1 summarizes the correspondences between sentence-initial structures in 
the English high-rate group and the three types of discontinuities: 

In tabulating the changes in time, place, or participant, we set up three functional 
categories, namely temporal, spatial, and participant discontinuities. There are alto¬ 
gether 14 sentences in the high-rate group. Table 1 records the correspondences between 
the sentence-initial structures and certain discontinuities in 13 sentences. Because one 
of them has two initial structures. Table 1 actually contains 14 tokens of structure. 

A case of temporal or spatial discontinuity corresponds to a sentence-initial struc¬ 
ture indicating a clear change in time or place. For instance, now in (i)a is recorded as 
adverb expressing a temporal discontinuity, whereas back at my office in (i)b is treated 
as a case of prepositional phrase indicating a spatial discontinuity. The next morning 
in (i)c is regarded as a noun phrase informing a temporal discontinuity. If a sentence 
starts with a new participant in its subject position, it is treated as a case of participant 
discontinuity. A fellow vet in (3)b is an instance of such a case. 

A case of temporal discontinuity may also be a sentence-initial participial phrase, a 
prepositional phrase, or a subordinate clause such as hearing of her elephantine rasp¬ 
berries in (i)a, before attempting to scale the heights in the real world in (2)a, and after 
I conquered Jackie in (2)c. What these structures have in common is the indication 
of a particular time by referring to a specific event or activity in the previous epi¬ 
sode. According to Givon (1993:315), all these sentence-initial structures have their 
coherence links in two directions, anaphoric and cataphoric. Their anaphoric links 
can reach back to thematic information anywhere in the preceding episode. Their 
cataphoric links anchor themselves nicely to the main clause, which then launches 
the new episode. (One transitional sentence in the English high-rate group had to be 
analyzed on its own.) 

As recorded in Table 1, the initial structures in all the English high-rate group sen¬ 
tences but one are found to carry the information indicating major changes in time, 
place, or participant. A very prominent feature about the structures expressing these 
changes is that except in the case of participant discontinuity in Table 1, which is 
expressed by the indefinite noun phrase in the subject position (A fellow vet once 
told me...), all the other structures informing changes in time or place are preposed 
and therefore ‘marked’ structures. These preposed structures are regarded as marked 
because the canonical positions for these structures in English sentences are at the 
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Syntactic Categories 

Functional Categories 

Temporal 

Discontinuity 

Spatial 

Discontinuity 

Participant 

Discontinuity 

Adverb 

2 



Prepositional Phrase 

3 

2 


Participial Phrase 

1 



Noun Phrase 

3 

1 

1 

Subordinate Clause 

1 



Total 

10 

3 

1 


Table 1. Sentence-initial structures expressing different types of discontinuities (based 
on sentences of the English high-rate group: 13/14=93%). 


end of the sentence (e.g., Clark & Clark 1977; Givon 1983; Quirk et al. 1985). The con¬ 
sistent use of these marked structures constitutes a very prominent feature about 
the way temporal and spatial discontinuities are expressed in the episode-initial sen¬ 
tences. This observation supports Prideaux & Hogans (1993) finding about the fre¬ 
quent use of the marked subordinate clause of time at episode-initial position. It 
is also compatible with Virtanen’s (1992) data of clause-initial adverbials used at 
the beginning of textual units. Following Givon (1993), Prideaux & Hogan (1993), I 
believe that the consistent use of marked structures at episode-initial position is func¬ 
tionally motivated. This functional motivation is to signal the start of a new episode 
by referring to the changes in time and place. 

3.2. sentences in the Chinese high-rate group. The segmented Chinese texts 
obtained from Experiment 1 were processed in the same way as their English counter¬ 
parts. The episode-initial sentences in the Chinese high-rate group may be exempli¬ 
fied by those occurring in the two excerpts below: 

(4) a. shengcheng xia lai de nei ge zhiqing kaishi 

provincial capital descend come nom 1 that cl educated youth initially 

ting bu dong houlai mingbai le juede youqu jiu yixiao. 

listen not understand afterwards understand pfv feel funny just laugh 

‘The educated youth that came from the provincial capital did not understand 
(the term) in the first place. Understanding it later he thought it was funny and 
laughed about it.’ (followed by an account of how the young man differed from the 
villagers in speech and for that reason was laughed at by them) 

b. ruogan man hou, shengcheng xia lai de nei ge zhiqing 

several years later provincial capital descend come nom that cl educated youth 

bujin tingdong le xiangcun de fangyan liyu, erqie po you yanjiu. 

not only understand pfv countryside assoc dialect slang but also quite have research 

‘Several years later, the educated youth that came from the provincial capital not 
only understood the dialect and slang in the rural area but actually knew quite 
a lot about them.’ (followed by an account of how he got the nickname Wutian 
from the villagers) 
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(4) c. Wutian qing shou, baijing, tianting baoman, shouzhi shuicong yiyang. 

Wutian pretty thin fair and clean forehead plump finger water onion same shape 
‘Wutian is pretty thin and looks fair and clean. His forehead is plump. His fingers are 
just like water onion.’ (followed by an account of how the local people accepted 
him) 

d. zhiqing gang xia lai de shihou, cunliren you xinxian gan, 

educated youth just descend come nom time villager have fresh interest 
you shi mei shi dou xihuan zuan zhiqing wu zuozuo, 

have thing not have thing all like go into educated youth rooms sit a while 
‘When the educated youths first came, the villagers had great interest. Whether or 
not they had anything to do (with the educated youths), they all liked to go into 
their rooms and sit for a while.’ (followed by an account of how the young man 
Wutian had his first romantic experience with a local girl) 

(5) a. dao Changdu hou, wo qing xian renshi ju zhang 

reach Changdu after I ask county personnel bureau director 

Xiangba zuo fanyi, ta xinran daying. 

Xiangba to be interpreter he gladly agree 
‘After (we) arrived at Changdu, I asked the director of the personnel bureau of the 
county to be my interpreter. He gladly agreed.’ (followed by a description of the 
condition of the road in Tibet) 

b. chu Changdu bu yuan, jiu likai le Chuangzang gonglu de zhugandao. 

leave Changdu not far soon leave pfv Sichuan-Tibet road nom main line 

‘Soon after we were out of Changdu, (we) left the main line of the Sichuan-Tibet 
road.’ (followed by a description of the condition of the road ahead) 

c. bangwan shifen, women dao le Caiwei qu. 

evening time we reach pfv Caiwei district 

‘In the evening, we arrived at Caiwei district.’ (followed by a description of the place 
where the district administration is located) 

d. quli zhi you jige ren zai. quli 

district administration only have a few people remain district administration 

de lingdao dao xianli kai hui qu le. 

assoc officials to county hold meeting go pfv 

‘There are only a few people left at the district administration. The officials of the 
district administration have gone to the county to attend a meeting.’ (followed by 
an account of their experience of having dinner at the local government) 

e. chi le wanfan, Gesang zai yuanzi li ca che, 

eat pfv supper Gesang at courtyard in clean car 

wo he Xiangba sanbu zou chu. 

I with Xiangba take a walk go out 

‘After we had supper, Gesang was cleaning his car in the courtyard. I and Xiangba 
went out in the manner of taking a walk.’ (followed by an account of visiting a 
local Tibetan in his house) 

In the analysis of the Chinese high-rate group sentences, (4) a represents a case 
where the subject noun phrase (modified by a relative clause) introduces a new 
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Syntactic Categories 

Functional Categories 

Temporal 

Discontinuity 

Spatial 

Discontinuity 

Participant 

Discontinuity 

Adverb 

3 



Prepositional Phrase 

5 


4 

Participial Phrase 



5 

Noun Phrase 

3 

2 


Subordinate Clause 



2 

Total 

10 

2 

11 


Table 2. Sentence-initial structures expressing different types of discontinuities (based 
on sentences in the Chinese high-rate group: 22/23=96%). 


participant into the story. (4)0 is a case of subject noun phrase serving the same func¬ 
tion but has no modification. In cases where the subject noun phrases are modified, 
either by a relative clause or a genitive phrase, the modifying structures often provide 
the necessary background information or convenient reference point for the intro¬ 
duction of the new participant. (5)d is an existential sentence. An existential sentence 
in Chinese such as (5)d starts with a location, which is conceptualized as a possessor. 
What exists at this location is presented as the possessed. Despite this, we do not tabu¬ 
late this initial noun phrase of location as a structure involving a change of location 
because in these circumstances the location has already been activated in the previous 
episode. Instead we treat the whole sentence pattern as a special form for introducing 
new participants, which, unlike the subject noun phrases in normal sentences, comes 
at the end of the sentence. Following Lambrecht (1988:15-16), I regard the sentence- 
initial noun phrase of location as serving the function of anchoring the new referent 
in the discourse. 

Temporal discontinuity is also expressed in various ways. (4)b and (5)0 are 
instances of noun phrases indicating major changes in time. As a subordinate clause, 

(4) d and (s)e each provides specific information of time relating to a specific event. 

(5) a and b are both subordinate clauses of time in form, though (5)b contains no tem¬ 
poral conjunction. However, the changes in place indicated by the verb phrases in 
both clauses seem to be more informative than the times in relation to them. Based on 
this reasoning, (5)a and b are categorized as two cases of subordinate clauses respon¬ 
sible for spatial discontinuities. 

Table 2 records the correspondences between initial structures of the 22 sentences in 
the Chinese high-rate group and the specific discontinuities they indicate. The Chinese 
high-rate group consists of 23 sentences. Among the 22 sentences recorded in Table 2, 
one sentence contains two initial structures—as a result, 23 tokens are listed. One sen¬ 
tence in the Chinese high-rate group is categorized as a case of thematic discontinuity. 
Because it is the only case of thematic discontinuity it is treated on its own. 

From the above discussion we can see that most of the sentences in the Chinese 
high-rate group are found to be characterized by major changes in time, place, or 
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participant. Except in the two existential sentences, the change in participant is con¬ 
sistently expressed by the subject noun phrase at sentence-initial position. Changes 
in time and place, on the other hand, are expressed by such sentence-initial structures 
as adverbs, noun phrases, and subordinate clauses. The expressions of these changes 
in the Chinese data are quite compatible with those observed in the English data in 
the sense that as carriers of information about important changes in time, place, or 
participant they need to be placed right at the very beginning of an episode (unless a 
special sentence pattern such as existential sentence or cleft sentence is used). 

In the analysis of the English data, the sentence-initial structures responsible 
for indicating changes in time and place are characterized as marked (in com¬ 
parison with their canonical positions at the end of the sentence). In the case of 
Chinese, however, the same structures regularly occur at sentence-initial position 
as ‘topics’(Chafe 1976: 50-51; Li & Thompson 1981:94-95) since Chinese is a 
topic-prominent language (Li & Thompson 1981). The functional constraint that 
the information about important changes in time and place is crucial for the 
introduction of a new episode provides a good perspective for similar syntactic 
behaviors from two syntactically very different languages. 

3.3. COMPARING HIGH-RATE GROUPS WITH LOW-RATE GROUPS. Although my 

investigation was focused on sentences in the high-rate groups I also had a close look at 
sentences identified with 40%-65% agreement rates in the low-rate groups of the two 
languages. What I find is that sentences in the low-rate groups are also characterized 
by the three types of discontinuities. But they are indicators of local changes within the 
contexts of episodes set by the transitional sentences of the high-rate groups. Another 
finding is that many sentences in the low-rate groups may be characterized as intro¬ 
ducers of new themes. Considering this fact and the fact that only one case of thematic 
discontinuity was identified in the high-rate groups, sections specifying certain themes 
seem more likely to be regarded as embedded units within episodes. 

4. summary. As the above analysis shows, the result of the segmentation task supports 
the hypothesis of this study. In more specific terms, the result may be summarized as 
follows: 

First, in identifying episode transitions both English- and Chinese-speaking subjects 
used the same strategy: they looked for sentences that involve changes in time, place, or 
participant. This strategy indicates that these changes are natural indicators of episode 
transitions in narrative texts and as such they are perceived by language users. 

Second, episode transitions are primarily defined by major changes in time, place, 
or participant. The low-rate cases typically involve local changes in these dimensions. 
The lesser agreement in identifying the low-rate cases implies that they represent less 
clear cases of episode transitions. 

Third, in episode-initial sentences the changes in time and place are typically 
encoded in preposed structures whereas the changes in participant are often indi¬ 
cated by subject noun phrases. 
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Finally, the result of the experiment has made us aware that if changes in time, 
place, or participant generally correlate with episode transitions, it means that the 
discourse elements of time, place, or participant are also important to our concep¬ 
tion of episode as a coherent unit. In other words, our sense of episode as a the¬ 
matically coherent unit is derived from our evaluation of the roles played by these 
elements in the formation of the mental representation of an episode. This evalua¬ 
tion is cognitively based in the sense that we (together with other people) engage in 
many different activities in different periods of time at different places every day and 
our common view as to which activities should belong to one rather than another 
‘episode of our daily life seems to rely heavily on the parameters of the time or place 
of the activities as well as who participate in them. 


Abbreviations: assoc - associative; cl - classifier; nom - nominalizer; 
pfv - perfective aspect 
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departing from everyday usage is a nearly universal phenomenon for poets, 
whether we call it poetic style or poetic language. This can go much further when the 
words are set to music, as I will show in examples from well-loved masterpieces. 

1. the first one is by Gilbert and Sullivan (1878), Sir Joseph Porter’s song in H.M. S. 
Pinafore: 

When I was a lad I served a term 
As office boy to an attorney’s firm. 

I cleaned the windows and I swept the floor, 

And I polished up the handle of the big front door. 

(chorus: He polished up the handle of the big front door.) 

I polished up the handle so care-ful-lee 
That now I am the ruler of the Queen’s Na-vee. 

(chorus: He polished up the handle to carefullee 
That now he is the ruler of the Queen’s Navee.) 

In this ‘patter’ song the vocabulary is plain, unpoetic, and the meter purposely uneven: 
artfulness is therefore displayed all the more in the rhyming of the couplets. Then, 
in the last couplet, the world ruler —though a little on the literary side—is a little less 
fancy a title than First Lord of the Admiralty would be with its archaic sound, and 
the Queen’s Navy is an up-to-date substitute for the Admiralty. But only a comedian 
would rhyme ‘Navee’ with ‘carefullee’; the unstressed -y in a two-syllable word cannot 
otherwise be treated as a rhyming vowel 1 . 

However, the rhyme carefullee : Navee is quite tame, compared to the one in the 
third stanza (and thereafter) 2 : 

In serving writs I mades such a name 
That an articled clerk I soon became; 

I wore clean collars and a bran new suit 
For the pass examination at the Institute 
(chorus: For the pass examination at the Institute.) 

That pass examination did so well for me 
That now I am the ruler of the Queen’s Na-vee. 
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(chorus: That pass examination did so well for he [!] 
That now he is the ruler of the Queens Navee.) 


Nothing in the English language would bring on ‘so well for he’, except a choral refrain 
echoing ‘so well for me. Truly a trick of genius, however outrageous. 

The idea of taking such liberties occurred to Gilbert and Sullivan perhaps from 
older but lesser versifiers in the English theater, whose ephemeral compositions were 
soon forgotten. But another influence came through Gilbert’s own labor, when he 
dipped into the libretti of Italian operas. For there was a sizable public in London 
for opera, both in the original and in translation. Unlike the fictitious Joseph Porter, 
who grew rich in the practice of law, the real man Gilbert after passing the bar could 
not earn a living at it; eventually he put his legal knowledge to better use in numer¬ 
ous scenes of his comic operettas 3 . Meanwhile he found some odd jobs by writing an 
English text based on operas already popular on the Continent. Elis first minor suc¬ 
cess in that line was a translation of a French soprano aria from Manon Lescaut by 
Auber 4 . After that, but before he began collaborating with Sullivan, he wrote Dulca¬ 
mara (1866), a take-off on Donizettis comic masterpiece L’Elisir d’amore, which had 
become a familiar classic 5 . 

As is well known, Gilbert and Sullivans operettas often parody the style of Italian, 
French, or German opera—tragic opera still more than comic. This extends, as I 
maintain, to details of meter and musical diction. I have not discovered an exact Ital¬ 
ian or other precedent for rhymes like ‘so well for he /.. .the Queens Navee’; but many 
devices of Italian grand opera did attract these talented Englishmen and are further¬ 
more worth appreciating in their own right, even apart from the use that Gilbert and 
Sullivan made of them. For from European culture of the nineteenth century noth¬ 
ing has proved more enduring than opera, nor broader, more nearly universal in its 
human appeal 6 . 


11. words such as carefully, stressed on the antepenultimate syllable, are less 
important in English than in Italian versification, where they are termed voci sdruc- 
ciole (literally ‘slippery words’). For according to the favorite meters of lyric poetry, 
the opening line of a stanza or strophe ends in a voce sdrucciola, and some or all of 
the odd-numbered lines thereafter end in another voce sdrucciola, non-rhyming as 
a rule 7 ; e.g., Leonora’s cavatina in II Trovatore (Act I, scene 2) by Verdi: 


Tacea la notte placida 
e bella in ciel sereno; 
la luna il viso argenteo 
mostrava lieto e pieno. 


(‘Still was the night, placid 
and fair in a sky serene; 
the moon her silvery face 
showed gay and full...’) 


The two unstressed syllables of placida, and again -teo in the third line, bring out 
poetically the continuing suspense, as the stanza is not yet ready to come to a close. 
The composer too liked, still more, to dwell upon those syllables 8 . 
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Later on, in Leonoras half of the frenzied duet with the villainous Count di Luna 
(Act IV, scene 1), Verdi gave a higher pitch to the final syllable of each voce sdrucciola: 

Mira, di acerbe lagrime (‘Look, of harsh tears 

spargo al piede tuo un rio. I shed at thy 9 foot a stream. 

Non basta il pianto? Svenami, Weeping is not enough? Stab me, 
ti bevi il sangue mio. drink up my blood. 

Calpesta il mio cadavere Trample my corpse 

ma salva il trovator. but save the troubadour.’) 


The untoward emphasis upon the last syllable of lagrime ‘tears’ after the normal stress 
upon la-, adds immensely to the effect 10 . Even more, svenami pours out the heroine’s 
anguish. For normally the suffixed pronoun -mi would be weak, as in baciami ‘kiss 
me’—whereas to give emphasis, ‘kiss me’ (not someone else), calls for bacia me, using 
the independent form of the object pronoun. But svena me ‘stab me’ would in any case 
be incompatible with the meter at this point of the stanza; nevertheless, svenami com¬ 
municates most pathetically, as a voce sdrucciola but with -mi sung on a prolonged 
F after svena- on E flat. Music triumphs over the rules of the Italian language 11 . How¬ 
ever, when the word is repeated five times in the rest of the duet, -mi resumes its 
normal phonetic status 12 . 

The liberty that Verdi took with his national language, by varying the stress upon 
svenami, had one supreme precedent: Mozart’s ‘Alleluja in the motet Exsultate, jubi¬ 
late. This, to be sure, is a very special case; for in Church Latin, and before that in the 
Greek of the Septuagint, the Hebrew expression rP ibbn (HLLW YH} had always been 
treated as untranslatable, even though the meaning is clear enough in Hebrew: the 
verb ‘Praise ye’ with its object the shorter form of the divine name (which otherwise 
is written as the Tetragrammaton mn 1 {YHWH}) 13 . Moreover, the ancient synagogues 
whose main language was Greek cherished this one familiar bit of Hebrew from the 
Psalms. Thereafter the Greek and other Christian churches carried on the chanting of 
Psalms, and thus Alleluia spread in time to every European language. Accenting the 
vowel [u] in the penult of this long foreign word was congenial to Greek, and likewise 
to Latin; but it lacks the sort of inherent fixity that belongs to a native word such 
as fortuna or pictura. So nothing blocked Mozart’s impulse to move right on from 
Alleluja to Alleluja, and back and forth many more times 14 . 

Verdi exercised a different sort of license to suggest a rhetorical tone in La Traviata 
(Act II, scene 1), where the elder Germont, the father of Violetta’s lover, remonstrates 
with her to give him up for the sake of his family: 


Pura siccome un angelo 
Iddio mi die una figlia. 

Se Alfredo nega riedere 
in seno alia famiglia, 
l’amato e amante giovine 


(‘Pure as an angel 
God gave me a daughter. 

If Alfred will not come back 
into the bosom of the family, 
the beloved and loving youth 
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cui sposa andar dovea 
or si ricusa al vincolo 
che lieti ne rendea. 

Deh non mutate in triboli 
le rose dell’ amor. 

A’ prieghi miei resistere 
non voglia il vostro cor. 


whose bride she was to be 
now draws back from the bond 
that made us happy. 

Pray, do not change into thorns 
the roses of love. 

To withstand my pleas 
let not your heart wish.’) 


The natural stress upon the first syllable of pura sets a pattern for [’iddio] instead 
of the normal [idd'io]. Similarly se AI-, which counts as one syllable metrically, gets 
the beat that would otherwise come on the middle syllable of Alfredo. Then lamato 
is handled just like Iddio; and le rose also with unusual stress upon the article. But 
Verdi applies the device flexibly, not to each and every verse; and indeed, when ‘le 
rose dell’ amor’ is repeated, the natural pronunciation le rose reasserts itself 15 . For here 
the speaker shifts from deliberate argument to a heart-to-heart appeal; the composer 
enhances it by inserting in the last verse no, no before non voglia —regardless of the 
poetic meter—and again at the very end. 

The closing scene of II Trovatore shows how the poetic libretto—before the com¬ 
poser took over—played neatly upon two contrasting infinitives, as the poison that 
Leonora had swallowed seizes her and she says to her truelove Manrico: 

Prima che d’altri vivere (‘Sooner than to live as another’s 

io volli tua morir. I wished to die thine.’) 

The morphology of Italian, or of the standard Tuscan dialect, came with vivere, a voce 
sdrucciola, and morire, a voce piana (a ‘plain word’—i.e., stressed on the next-to-last 
syllable). The vowel at the end of either infinitive may be dropped in a non-final 
position. Moreover, the special rules of lyric poetry require at the end of a stanza a 
voce tronca (a ‘clipped word’) with a final accented vowel or else with a liquid or nasal 
consonant 16 . So the poet Cammarano reinforced the opposite meaning of the infini¬ 
tives by putting vivere at the end of one line and balancing it with the clipped morir. 
Verdi went further, spreading vivere over six notes, from C up to A flat, and then limit¬ 
ing morir to two notes: mo- on the same C but -rir down to B flat. 

in. wagner, writing as a german critic of the Italian composers, censured them 
for distorting the native sound of the words so as to fit the beat of the music 17 . This, 
however, was due in part to a long-established principle of Italian poetry, that the 
meters should be flexible within limits; e.g., an iambic verse allows the substitution of 
a trochee in one or two measures, except at the end, and thus the pattern from verse to 
verse varies subtly and pleasantly. But when combined with a tune, the words become 
subordinate; if the wrong syllable, here and there, gets the beat, it scarcely hurts our 
enjoyment of the song. In this cabaletta from Don Pasquale by Donizetti: 
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So anch’ io la virtu magica 
dun guardo a tempo e loco; 
so anch’ io come si bruciano 
i cori a lento foco. 

D’un breve sorrisetto 
conosco anch’ io l’effetto, 
di menzognera lagrima, 
d’un subito languor... 


(‘I too know the magic power 

of a glance at the [right] time and place; 

I too know how hearts burn in slow fire. 


Of a brief little smile 
I too know the effect, 
of a false tear, 


of a sudden faint...) 


the rollicking tune, in the third line, slides past come ‘how’ as though it were come. In 
any case the melody suits the mood handsomely; I cannot wish that Donizetti or his 
librettist had thought of something to obviate this mild irregularity, a trochaic second 
measure instead of an iamb 18 . 

Music in Germany was less inclined than in Italy to cause a shift from one syl¬ 
lable to another, because of a basic phonological difference between the languages. 
Every syllable in Italian has a full vowel, whereas German words of two or more syl¬ 
lables have one with a weak vowel; therefore to stress the second syllable of komme or 
kommen ‘(to) come’, as was done in the Italian [kom'e], would be grotesque. Neverthe¬ 
less, German compounds, which have a minor stress upon one syllable of the second 
component, lend themselves to uneven treatment in song. Schubert, before Wagner’s 
time, took advantage of this in his setting of Goethe’s ballad Der Erlkonig. Since the 
poet had varied the compound between three syllables and four ( Erlenkonig ): 

Siehst, Vater, du den Erlkonig nicht (‘Seest thou not, father, the Erlking? 
den Erlenkonig mit Kron und Schweif? the Erlenking with crown and tail?’) 19 

—with or without the definite article: 

Mein Vater, mein Vater, und horest du nicht (‘... and hearest thou not 

was Erlenkonig mir leise verspricht? what Erlenking softly promises me?’) 

the composer was entitled to a further variation, weakening the stress on the first syl¬ 
lable and placing the beat instead right after it: 

Mein Vater, mein Vater, und siehst du nicht dort (‘.. .and seest thou not there 
Erlkonigs Tochter am diistern Ort? Erlking’s daughters in the 


gloomy place?’) 


Metrically the verse with Erlkonigs is quite parallel to this sequel: 


Mein Vater, mein Vater, jetzt fa fit er mich an! (‘.. .now he grabs me! 
Erlkonig hat mir ein Leids getan! Erlking has done me a hurt!’) 
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but here Schubert returns to the normal stress. 

With his sense of drama he was free to exploit such possibilities of his language, 
somewhat as Verdi or Donizetti did in working with an Italian text. Schubert added 
the power of musical pitch to build toward a climax: The boy’s first utterance begins 
on C, his next couplet on D, the third on E, and the last one on F as his terror keeps 
mounting. The music prepares us emotionally, more than words alone could, for the 
closing sentence: ‘In seinen Armen das Kind war tot’ (‘In his arms the child was 
dead’). 


iv. my final example is from a French tragic opera, Les Pecheurs de perles by Bizet, 
where—in a manner quite different from Gilbert and Sullivan—the rhyme that ought 
to close the stanza gets shifted so as not to make it the last word: 


Je crois entendre encore 
cache sous les palmiers 
sa voix tendre et sonore 
comme un chant de ramiers. 
O nuit enchanteresse, 
divin ravissement! 

O souvenir charmant, 
doux reve, folle ivresse! 

Aux clartes des etoiles 
je crois encor la voir 
entr’ouvrir ses longs voiles 
aux vents tiedes du soir. 

O nuit enchanteresse,... 


(‘I seem to hear again, 
hidden under the palm trees, 
her voice tender and tuneful 
like a song of wood doves. 

O enchantress night, 
divine rapture! 

O charming memory, 
sweet dream, mad inebriation! 

By the twinkling of the stars 
I seem again to see her 
half-opening her long veils 
to the warm evening breezes. 
O enchantress night,...) 


The refrain has folle ivresse rhyming with enchanteresse; but the score changes the last 
line to end with doux reve, which does not rhyme with anything but expresses better 
the mood of the hero Nadir. Although the French noun ivresse does not convey the 
same moral disapproval on the part of the speaker as its literal equivalent drunken¬ 
ness in English, still the composer was wise to depart from the order of words given 
by the librettist 20 . For as the lover reminisces, his senses recapture a delicately blended 
revery; the exquisite sexual hints float down on him one by one. Some other com¬ 
poser, responding to the same words, might have scored them to make folle ivresse not 
only the very climax but the very last word; however, the melody that came to Bizet’s 
ears ends more gently 21 . 

Music and poetry are there to be enjoyed, and appreciated without requiring an 
analysis of the art that has gone into them. But if our intellectual curiosity is aroused, 
it can contribute something positive, explaining how the human voice fmds the ways 
to deliver its message beautifully, and thus to surpass the ordinary level of communi¬ 
cation through language 22 . 
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1 The vowel at the end of carefully might receive a secondary stress, being distant enough 
from the stress on the opening syllable, and would thus make an acceptable rhyme with 
[- 1 :] even in serious poetry.— Nauie or nauye in Middle English kept the stress on [i] as in 
Old French; Chaucer, Legend of Good Women (960); 

With shippes seven and with no more nauye, 

And glad was he to londe for to hye... 

It is conceivable that the humorist Gilbert consciously harked back to that. 

2 Here is the second stanza; As office boy I made such a mark 

That they gave me the post of a junior clerk. 

I served the writs with a smile so bland, 

And I copied all the letters in a big round hand. 

(chorus: He copied all the letters in a big round hand.) 

I copied all the letters in a hand so free, 

That now I am the ruler of the Queen’s Na-vee. 

(chorus: He copied all the letters in a hand so free, 

That now he is the ruler of the Queens Navee.) 

3 E.g. the Lord Chancellor’s song in lolanthe: The Law is the true embodiment 

Of everything that’s excellent. 

It has no kind of fault or flaw. 

And I, my lords, embody the Law. 

The irony is all the more delicious if we happen to know that this librettist had been a 
barrister. A later stanza, about the Lord Chancellor functioning as guardian of wards in 
Chancery: And everyone who’d marry a ward 

Must come to me for my accord, 

And in my court I sit all day 
Giving agreeable girls away, 

With one for him—and one for he— 

And one for you—and one for ye— 

And one for thou—and one for thee— 

But never, oh never a one for me! 

exploits, as in Pinafore , the rhyming of pronouns contrary to all normal syntax. 

4 Auber s operatic setting (1856) of the novel by Prevost was destined to lose out eventually 
to Massenet’s (1884)—besides the one in Italian by Puccini (1893). 

5 Gilbert’s title came from the quack doctor Dulcamara (= ‘Bittersweet’, an herb), who sold 
all kinds of elixirs and other nostrums. Gilbert gave most of the characters the same 
names as in the opera itself, but he took over just one of Donizetti’s airs; the rest he got 
from various other sources. 

6 Just in recent years, when the Chinese opened their minds wide to the attractions of 
the Occident, the audiences have reported that La Traviata touches their emotions more 
deeply than any musical drama from their own tradition. 

7 Dr. Dulcamara’s spiel, however, is full of voci sdrucciole such as these: 

Per questo mio specifico, (‘Through this my specific, 

simpatico, prolifico, sympathic, prolific, 

un uom settuagenario a man, septuagenarian 

e valetudinario and valetudinarian, 

nonno di dieci bamboli became again a grandfather of ten babies.’) 

ancora divento. 
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The music also is somewhat reminiscent of the style of English‘patter’ songs: every syllable 
from Per to the end of valetudinario is on the pitch of A, with calculated monotony. 

8 From a jolly English operetta it may be a disconcerting shift to ll Trovatore, where even the 
music of love is so somber. But the plot of Pinafore was actually derived by Gilbert from ll 
Trovatore with grotesque precision: a hero deprived in infancy of his rightful place among 
the nobility, through the gypsy Azucena; her comic counterpart, however preposterous, is 
Little Buttercup. 

9 For the English glosses I resort to the archaic second person singular, because Salvatore 
Cammarano, Verdi’s librettist, makes all the characters consistently darsi del ‘tu (as the 
Italians put it), regardless of any difference in rank or emotional state. He chose not 
to adhere to the usage of his source, the Spanish play El Trovador by Antonio Garcia 
Gutierrez, where Leonor at the beginning of the scene addresses De Luna in the second 
person plural,‘ jMe conoceis, Conde?’ (‘You know me, Count?’) but afterwards shifts to the 
singular: ‘Y tu en premio a mi ternura...’ (‘And thou as a reward for my tenderness...’) 

10 Here I show lagrime with two accents. Italian orthography, however, forbids the writing 
of any accent on lagrime or the singular lagrima, or any other voce sdrucciola. Spanish 
orthography, on the contrary, requires an accent on lagrima(s). The term voces esdrujulas 
was taken from Italian but modified in accord with Spanish phonology. 

11 Di Luna’s part of the duet matches Leonora’s metrically, as usual in opera; but since his 
words—and feelings—clash utterly with hers, 


Ah! dell’ indegno rendere 


(‘Ah! to render the worthless one’s 
fate worse I would like 
amid a thousand dismal wrenchings 
to make his death a hundred fold. 

The more thou lov’st him, the more terrible 
blazes my wrath!’). 


vorrei peggior la sorte 
fra mille atroci spasimi 
centuplicar sua morte. 
Piii l’ami e piu terribile 
divampa il mio furor! 


Verdi sets them to a quite different melody. In particular, all three syllables of rendere, spa¬ 
simi, and -ribile come out on one pitch, middle C. At the close of the duet, when Leonora 
and Di Luna together sing their last two verses, she repeats the same tune as before; and he 
sings ‘Piu l’ami e piu terribile’ exactly an octave lower than her ‘Calpesta il mio cadavere’, 
except that the final syllable of his terribile drops to D flat while that of cadavere rises to F. 

Although Verdi’s melodramatic effects surpassed than anything earlier, Donizetti had 
already demonstrated the value of raising the pitch at the end of a verso sdrucciolo, as in 
Una furtiva lagrima (‘A stealthy tear 

negli occhi suoi spunto; trickled in her eyes; 
quelle festose giovani those cheerful girls 

invidiar sembro:... she seemed to envy...’ L’Elisir d ’amore, Act II, scene 8). 

12 That includes two instances of Mi svena, the syntactical equivalent of Svenami. 

13 For example, when the brief Psalm 117 (116) was translated into Greek, the opening words 
mrP ns lVVn were routinely translated cuveIte tov Kijpiov ‘Praise ye the Lord’, with the 
common noun subsituting for the name itself, just as a Hebrew reader in that era would 
pronounce the noun TIN [ ? adoniy] ‘my Lord’ instead of [y-h(-)w- h ] (vowels uncertain). 
But the closing words of the Psalm, n 1 lV?n, were just translated dXXqXomcc. The rule 
making it a sacrilege to pronounce mrP did not in principle apply to rp; and the most 
authoritative pronunciation, [yoh] survived into the medieval (and modern) tradition of 
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Scripture reading. Neverthelss, in Greek the substitute Kijpiov (nominative Kijpioc;) was 
employed to translate rp (Ps. 94 [93] :7,118 [117] :8, etc.), even when some other form of the 
verb 55 n ‘praise’ accompanies it: Thus NTil D 571 but rai Xaog 6 Kti^opevoi; aiveaei 

tov Kupiov ‘and a people being [now] created will praise the Lord’ (Ps. 102 [101] :19b Only 
in the formula m l 55 n was the two-letter divine name (with the Greek pronunciation [ia] 
or [ya] exempt from being replaced by Kupi-. 

14 It might have encouraged him all the more, if he happened to learn from some Jewish 
informant that the authentic Hebrew accent belongs at the end, where it gives due promi¬ 
nence to the object of the verb ‘praise’. 

After my oral presentation in Houston, my friend Barbara Baez gently reminded me 
that Handel too (earlier in the 18th century) had dwelt upon the shift of beat in the ‘Hal¬ 
lelujah’ chorus of The Messiah. Now checking the score, I perceive that at different points 
he gave the beat to each of the four syllables, but much more often it comes on -lu- than on 
any other place. Mozart himself played at least some of the music from this great oratorio 
and introduced here and there his own contrapuntal additions into the accompaniment. 
I must leave it to the musicologists to prove whether he was already acquainted with The 
Messiah when he composed Exsultate, jubilate. (Handel’s restoration of the initial conso¬ 
nant H- and the final -h was in compliance with the practice of English Protestants since 
the 16th century, when a number of scholars from England studied Hebrew in Germany. 
The spelling rules of English suggested to them that many Hebrew words in the English 
Bible, especially names, could be shown more accurately than had been done in the Latin 
Vulgate.) 

15 In the repetition it was more graceful to replace the insistent particle Deh with the rela¬ 
tively neutral or pathetic ah. 

16 The latter option indeed contravenes the phonology of Tuscan otherwise; for the few short 
words such as per and non are precursory and never end a sentence or a clause. Nonethe¬ 
less, since the lyric poetry in southern Italian dialects, as well as song, was free of such 
constraints and had been so warmly received throughout Italy, the poets writing in stan¬ 
dard Italian arrived at a compromise: in final position too, the voci piane can (with some 
exceptions) drop their final vowel, if preceded by l, m , n, or r. 

17 To Bellini alone, Wagner was more favorable than to the other Italians; see Hans-Joachim 
Bauer, Richard Wagner Lexikon (Bergisch Gladbach: Gustav Liibbe, 1988), 68-70. But I 
have noticed that Bellini too did occasionally sacrifice the proper accentuation, as in Pol- 
lione’s cavatina in Norma (Act I, scene 2): 

Udia d’lmene i cantici, (‘I heard the chants of Hymen, 

vedea fumar gl’incensi,... I saw the incense smoking ,...) 
where the normal stress would be on [-d'i-] in udi(v)a and on [-d'e-] in vede(v)a. 

18 By the ordinary sandhi of the Italian language, the noun and adjective virtu magica in the 
first line go so well together as to cancel or reduce the stress that would belong to [-t'u] in 
any environment other than an adjective such as [m'a-]. No need here to invoke poetic or 
musical license. 

19 My English gloss keeps the German variation between Erl- and Erlen-. But for semantic 
accuracy both Erlkonig and Erlenkonig should be glossed ‘Alderking’; the common noun 
Erie means ‘alder’. The credit for this goes to my friend Martin Bidney, professor of English 
at SUNY-Binghamton. 

20 The weak vowel [-a] at the end of ivresse comes on a high B, startlingly higher than the 
previous syllables and recalling the mild elevation of [-a] in enchanteresse. The high B 
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is strengthened by a pause, which marks the sharp contrast between ‘folle ivresse’ and 
what follows: ‘doux reve’; for the [-a] of reve is sung on A an octave lower, after the gentle 
descent of re- from C through B and A down to G, and this A makes a welcome pitch to 
rest upon. 

21 At the end of the second stanza, the score after ‘folle ivresse, doux reve’ repeats the previ¬ 
ous line ‘Charmant souvenir’ with the adjective and noun in reverse order, and here too 
the rhyme with ravissement is disregarded. 

To me this aria has personal meaning. It was my father’s favorite, although he did not 
understand French. In my early years I heard it often on a Victrola record by Caruso. The 
old records, along with the phonograph, were abandoned when the family moved; and 
only once in a while was I reminded of the arias that Caruso had made familiar. But then 
in November of 1945, when I was sailing home from Naples on a slow troopship, this aria 
from Les Pecheurs deperles was the only classical piece on the program of recorded music 
twice a day out on deck. The tenor Richard Crooks sang it, for some reason, in Italian, 
‘Mi pare udire ancora’; I listened to his rendition dozens of times. Years later I bought a 
long-playing record of Caruso arias and reacquainted myself with every verse of this one. 
(From the third verse I have lately taken the password for my e-mail.) 

22 I am grateful to my old friend Angela Della Volpe in LACUS and to my dear colleague 
Peter Cocozzella at SUNY-Binghamton for precious help with some fine points of Italian. 
It has also been for me a happy occasion to get acquainted with Bruce Borton, professor 
of music at SUNY-B; he has enabled me—however inexperienced—to record on a cassette 
tape the excerpts from Gilbert and Sullivan, Verdi, Mozart, Donizetti, Schubert, and Bizet. 


Portions of the scores of works cited, as well as brief recorded passages corresponding to 
the cited works are on the accompanying cd-rom. Additional copies of this material may be 
requested from: 

Saul Levin 

Department of Classical and Near Eastern Studies 
State Univerisity of New York 
Binghamton, New York 13902-6000 


e-mail: slevin@binghamton.edu 



EXPLICIT/IMPLICIT MODULARITY IN RIGHT HEMISPHERE PRAGMATICS 


Jennifer Mortimer 
McGill University 


introduction. Explicit Memory and Implicit Memory are dissociable and are 
subserved by different neurofunctional systems. They play complementary roles in 
supporting those aspects of language that are subserved by the Left Hemisphere: pho¬ 
nological, morphological and syntactic components of language are part of implicit 
competence, and non-rule governed lexical/semantic knowledge is largely supported 
by explicit memory. Modularity of linguistic functions in the LH is a theory grounded 
in more than a century’s worth of aphasic studies. In contrast, little research has been 
conducted regarding the possible dissociability of Right Hemisphere communicative 
functions (i.e., pragmatic functions: context-dependent and emotional aspects of lan¬ 
guage comprehension and production). In this paper I will discuss the hypothesis that 
implicit and explicit memory systems play dissociable and complementary roles in 
supporting not only components of left-hemisphere linguistic competence but also 
components of right-hemisphere pragmatic competence. The rationale for this paper 
will be based on the following points: first, that there is a double dissociation between 
explicit/implicit memory functions; second, that the explicit/implicit distinction is 
supported by clinical evidence; third, that different memory systems support different 
aspects of LH linguistic competence; fourth, that pragmatic competence is lateralized 
to the RH; and fifth, that there is clinical and experimental evidence to indicate the 
possibility of a dissociation between explicit and implicit aspects of pragmatics. 

1. explicit/implicit memory systems. Explicit knowledge is information that is 
consciously acquired and consciously used. It is our encyclopedic knowledge of the 
world—our memories of facts and events. In contrast, implicit competence refers 
to information that is acquired incidentally, is stored in such a way that it is opaque to 
introspection, is task-specific, and is used automatically (Paradis 1994). The abilities of 
implicit memory include the acquisition of skills and habits, non-associative learning, 
and simple classical conditioning (Squire & Zola-Morgan 1991). Accordingly, access 
to explicit knowledge is implicit, since the workings of our conscious minds are not 
consciously controlled. 

The neurofunctional model used in this paper to describe the distinction between 
explicit and implicit memory is based on aspects of Damasio’s theory of feature frag¬ 
ments that are compatible with Hebb’s cell assembly model (Damasio 1989). In this 
model, long-term memories are represented in the neo-cortex in the form of networks 
of locally connected neurons. Individual neurons within the networks are located 
in areas of the primary and early association cortices that specialize in storing and 
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processing specific types of information: motor, auditory, visual, olfactory, and haptic. 
The substrate for the convergence of these cells into networks is maintained by the 
tertiary neo-cortices, as well as by certain structures of the Medial Temporal Lobes 
(including the hippocampus, the parahippocampal cortex, the entorhinal and perirhi¬ 
nal cortex), the Frontal Lobes, and the Basal Ganglia and Cerebellum. Sensory cortices 
are interconnected to the hippocampus, and motor and sensory cortices are intercon¬ 
nected to the Basal Ganglia, Cerebellum, and Frontal lobes (Damasio 1989). Clinical 
studies show that selective impairment to any of these sites typically results in the fol¬ 
lowing impairments: (i) damage to the neo-cortex will destroy the conceptual features/ 
cells which, bound together, constitute memories, thereby rendering these memories 
inaccessible either in an explicit or an implicit form (ii) damage to the Medial Tempo¬ 
ral Lobes (MTL) will result in anterograde and/or retrograde amnesia—the inability to 
acquire or recall explicit memories—indicating that the MTL is indispensably involved 
in the consolidation of long-term memories in the neo-cortex (Zola-Morgan & Squire 
1990; Alvarez & Squire 1994; Graham & Hodges 1997) (iii) lesions in the region of the 
frontal cortex known as Brocas area will impair implicit access to linguistic representa¬ 
tions, and lesions to the basal ganglia or to the cerebellum will impair use of implicit 
motor and timing skills (Willingham et al. 1995; Harrington et al. 1998). 

In an unimpaired adult speaker, implicit/explicit modularity of linguistic func¬ 
tion is represented in the following manner. Lexical-semantic knowledge (arbitrary 
meanings of words) is considered explicit since this information is learnt and used 
consciously (we are overtly aware of the meanings of the words we use). This infor¬ 
mation is maintained in widely distributed areas of the left temporal and parietal 
lobes. Damage to neuroanatomical substrates, such as the MTL, that are involved in 
the acquisition of explicit memories, prevents the learning of new lexical items. 

Conversely, phonology, morphology, syntax, and some aspects of semantic compe¬ 
tence (such as argument structure) are implicit. Among native speakers of a language 
the use of rules that organize phonemes and morphemes into grammatical utterances 
is automatic, and the precise form of these rules is impenetrable to conscious analysis. 
Implicit linguistic competence is subserved in the LH by the ventro-lateral frontal 
cortex, the basal ganglia, and the cerebellum. These structures are involved in the tem¬ 
poral, sequential organization of information. Damage to these structures typically 
induces grammatical deficits such as linguistic aprosody, the loss of function words 
and of rules governing affixation, and in some cases, the inability to comprehend and/ 
or produce syntactically complex sentences (Grossman et al. 1992; Kolk 1998). 

2. clinical evidence for dissociable memory systems. Much of the evidence 
for a double dissociation between implicit and explicit memory systems is derived 
from clinical studies. Patients with Alzheimer’s Disease, which involves damage to the 
temporo-parietal neo-cortex and the MTL, are impaired in the explicit recollection 
of memories and the acquisition of new knowledge (including lexical items and word 
meanings). Implicit motor and perceptual skills, and grammatical rules, are compar¬ 
atively unaffected (Ullman et al. 1996). The famous patient H.M., who underwent 
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bilateral ablation of the MTL, is unable to learn new facts (such as words, his current 
age), but can still master new motor skills (Milner et al. 1968). Damage to Wernicke’s 
area in the LH typically produces deficits such as anomia, word-finding difficulties, 
and semantic paraphasias. Grammatical competence is generally spared. Among 
those patients who show implicit but not explicit memory damage are patients with 
Parkinson’s Disease (linked to the inadequate production of dopamine in the basal 
ganglia, producing hypokinesia/suppression of movement) and patients with Hun¬ 
tington’s Disease (linked to neuronal loss in the inhibitory pathways of the basal 
ganglia, leading to hyperkinesia/excitation of motor activity). PD and HD patients 
retain memories of lexical items and of other facts, but incorrectly apply the gram¬ 
matical rules which govern the use of affixes and function words (Ullman et al. 
1996). Damage to the left inferior frontal lobe can result in Broca’s aphasia, which is 
marked by speech apraxia and agrammatism (omission or substitution of grammati¬ 
cal morphemes in obligatory contexts), and an impairment in the skilled organiza¬ 
tion of phonemes and morphemes into sentences. However, when lexical items can 
be accessed, they are accessed correctly. 

3. right hemisphere pragmatic competence. Interest in the right hemisphere’s 
contribution to communicative competence has waxed and waned since Broca first 
proposed LH lateralization of language in normals. Hughlings Jackson noted that 
some aphasics retain the ability to utter swear words under emotional duress, which 
led him to suggest a distinction between propositional speech (the responsibility of 
the left hemisphere) and non-propositional speech (the responsibility of the right) 
(Jackson 1879). The latter refers to ‘automatic’ speech, including emotional exclama¬ 
tions. Pierre Marie (1906) remarked directly on the preserved pragmatic competence 
of left hemisphere damaged patients, saying that the ‘voluntary politeness of apha¬ 
sics’ was ‘additional proof of the conservation in their case of the affective and moral 
sphere’. These insights into the communicative capabilities of the right hemisphere 
were more or less ignored until the second half of the twentieth century. 

More recently, there has been research to suggest that the right hemisphere has 
a complementary, rather than a subservient, role in information processing (Sperry 
1974; Milner 1980). While the left hemisphere is said to specialize in sequential, con- 
text-independent analysis of information, the right hemisphere is holistic, visuo- 
spacial, context-dependent, and skilled at analyzing patterns (without analyzing the 
details of the patterns). 

In regards to communication, hemispheric specialization appears to take the form 
of LH dominance for the linguistic component of communication, and RH domi¬ 
nance for the pragmatic component. The LH’s responsibility is to derive meaning from 
utterances in accordance with a narrow set of inflexibly applied phonological, mor¬ 
phological, and syntactic rules. It is this part of language which is the focus of Chom- 
skian linguistic philosophy. The methodology that drives Chomskian-style linguistic 
analysis requires examining sentences in isolation; however, ordinary speech does not 
consist of a string of literal propositions. Consider the following exchange: 
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A to B: ‘I want you to read this book. I am sure you will enjoy it.’ 

B to A: ‘Yes, it looks fascinating.’ 

Taken by itself, there is only one possible interpretation of B’s statement: a literal inter¬ 
pretation. Nevertheless, if the utterance is placed in different contexts, different pos¬ 
sible interpretations for B’s comment become available. If person A is handing an 
exciting mystery novel to B, then a literal interpretation of B’s reply is appropriate. If 
person A is handing B a 6oo-page text on macroeconomics, then a non-literal transla¬ 
tion of B’s remark is likely to be correct (i.e., B is speaking ironically). In real-life situ¬ 
ations, we not only use literal meanings of words and sentences, we also commonly 
rely on emotional intonation, facial expression, emotional gestures, jokes, sarcasm, 
and irony to communicate efficiently with each other. While the Left Hemisphere 
analyzes the literal meaning of speech, the Right Hemisphere appears to be indispens¬ 
ably involved in processing non-literal aspects of communication (Shields 1991; Lorch 
et al. 1998). 

Rigorous testing of RH patients has highlighted language deficits that are con¬ 
sistent with this theory of lateralization of communicative functions. Studies have 
shown that difficulties with some aspects of prosody (changes in intonation indi¬ 
cating emotional states) generally co-occur with damage to the Right Hemisphere 
(Borod et al. 1998; Candour et al. 1995). RHDs may show deficits in understanding 
facial emotion (Blonder et al. 1991; Borod et al. 1998), affective gestures (Ross 1981), 
emotional words and sentences (Borod et al. 1992), sarcasm (Kaplan et al. 1990), figu¬ 
rative vs. literal meanings (‘He is turning over a new leaf’ as opposed to ‘He is losing 
his bad habits’) (Van Lancker & Kempler 1987), idiomatic expressions (‘Peter kicked 
the bucket’), jokes (the ‘punch-line’ is usually context-inappropriate), puns (which 
rely on a secondary context-inappropriate’ meaning), and irony (what is said is the 
opposite of what is meant—a fact which needs to be inferred from the context in 
which a sentence is spoken). RH patients also seem to have trouble drawing infer¬ 
ences from texts (understanding the theme of a story). In general, RHDs are deficient 
in grasping non-literal, implied meanings (the term for this condition is dyshyponoia 
(Paradis 1998)). 

There are numerous possible models to explain RH dominance in pragmatic abili¬ 
ties. One is that the RH specializes in the expression and recognition of affect (Ross 
et al. 1981). This theory would account for the apparent RH dominance for paralan- 
guage (emotional prosody, body language, and facial expressions). Another possibility 
is that the Right Hemisphere possesses a special ability for binding together a number 
of contextual cues to form meaning. Such an ability would explain how the RH is 
capable of using contextual information to comprehend non-literal meanings. Recog¬ 
nition of emotion can also be seen as being dependent on inference, in the sense that 
the perception of emotion requires the flexible interpretation of environmental cues. 
To understand RH processing, it may be necessary to take into account the contextual 
domain (discourse, situational or linguistic) from which inferences are made. Equally, 
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the nature of the subject about which inferences are made (i.e., the interlocutor’s state 
of mind) may be relevant. 

Unfortunately, although RH dominance for pragmatics is well-supported by clini¬ 
cal studies conducted over the past 20 years, there is little evidence to support any 
theory that might explain the manner in which pragmatic competence may be rep¬ 
resented in the RH. There are few studies, for example, which attempt to correlate 
RH deficits with intra-hemispheric lesion sites. In fact, studies rarely mention intra- 
hemispheric site of RH lesion, even though this information is usually available to the 
authors. 

In the following sections I will discuss pragmatic functions in terms of an explicit/ 
implicit memory systems modularity. I will base this programme on studies which 
have attempted to correlate lesion site with RH deficit. The criteria for determining 
which memory system supports a particular aspect of pragmatic competence will 
be the following: (i) explicit pragmatic knowledge must conform to the definition 
of explicit knowledge (i.e., it must be information learnt and used consciously), and 
implicit pragmatic skills must conform to the definition of implicit competence (i.e., 
skills acquired incidentally and used automatically and unconsciously), (ii) there 
must be clinical or experimental evidence to support classification of pragmatic func¬ 
tions as implicit/explicit. The evidence should show that RH substrates that anatomi¬ 
cally correspond with LH substrates supporting explicit linguistic functions in the 
left hemisphere (i.e., MTL) should be involved in explicit pragmatic functions in the 
right. RH substrates which support implicit memory (basal ganglia and cerebellum) 
and/or implicit linguistic competence (inferior frontal lobes or some other focal cor¬ 
tical site) should subserve implicit RH pragmatic functions. 

It must be emphasized that the neuroanatomical substrates mentioned above are 
believed to be indispensably, rather than exclusively, involved in supporting their 
respective memory systems. For example, damage to neo-cortical areas not directly 
involved in either memory system may impair a pragmatic function if this cortical area 
is responsible for processing the sensory information on which the function is based. 

4. prosody. Prosody consists of speech rhythm (timing and stress) and melody 
(pitch, tone and intonation). It has at least two functions: one is to support the pho¬ 
nological, morphological, syntactic and lexical elements of language (linguistic pros¬ 
ody), and the other is to convey affective information (emotional prosody) (Hird & 
Kirsner 1998). The most frequently examined acoustic correlates of emotional pros¬ 
ody are rhythm (expressed as timing and amplitude) and melody (expressed as varia¬ 
tions in fundamental frequency (FO)). Lateralization of prosody is generally said to 
occur along functional lines, with the left hemisphere dominant for linguistic prosody 
(Gandour et al. 1992a; Moen & Sundet 1996), and the right hemisphere for emotional 
prosody (Ross 1981). Although a claim has been made for LH specialization in the 
temporal aspects of prosody, and RH specialization in pitch and tone (FO), this theory 
is controversial (Gandour et al. 1992b; Niemi 1998). 
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RH affective prosody deficits (affective aprosody) are notoriously difficult to test 
without involving other RH pragmatic functions as base tasks. The tasks most com¬ 
monly used in determining affective prosody deficits require the subject to verbally 
describe the affective tone of a given utterance, or else to match the affective tone to a 
picture of a face representing the emotion expressed by the tone (e.g., happiness, sad¬ 
ness, etc.). Both emotional facial recognition and emotional words are believed to be at 
least partly mediated by the RH. An affective prosodic deficit therefore cannot neces¬ 
sarily be assumed to exist if the determining task involves an aspect of pragmatic com¬ 
munication that has not been controlled for. An affective prosody lateralization study 
by Ross et al. (1997) has highlighted the seriousness of this problem: Ross et al. found 
that reduced reliance on verbal processing to determine affective prosody deficits sub¬ 
stantially improves LHD but not RHD performance, which indicates that studies of 
affective prosody lateralization should require as little verbal processing as possible. 

5. is prosody implicit? The appreciation of prosody appears to correspond to the defi¬ 
nition of an implicit function: barring exceptional circumstances prosodic skill is used 
automatically, unconsciously and involuntarily (Cancelliere & Kertesz 1990; Hird & 
Kirsner 1998). Substrates supporting implicit skills should therefore be involved in the 
representation of prosody, and damage to these substrates should produce prosodic 
deficits. Damage to the basal ganglia in PD, HD and stroke patients has been found to 
co-occur with deficiencies in emotional and linguistic appreciation of prosody (Blonder 
et al. 1989; Cancelliere & Kertesz 1990; Speedie et al. 1990; Le Dorze et al. 1998). 

One would expect that, given that affective prosody is an implicit RH ability, the RH 
inferior frontal lobes and subcortical basal ganglionic structures in particular should 
be involved in supporting this function. Cohen et al. (1994) describe the case of a 
16-year-old right-handed patient who displayed flat, unanimated intonation after a 
stroke to the right putamen and the body of the right caudate nucleus. The patient 
showed ‘perfect comprehension of emotional gesturing’, but had difficulty in inferring 
emotion from affect-laden, pre-recorded speech, and was impaired at repeating and 
expressing affective intonation upon request. Propositional speech was unimpaired. 
Breitenstein et al. (1998), in a group study of RHD, LHD, PD patients and healthy con¬ 
trols found that patients with anterior RH damage ‘showed the strongest impairment 
in... prosodic emotional recognition and that PD patients in an advanced stage of 
the disease showed a similar deficit. The affective prosodic tasks used were two face¬ 
matching tasks, a naming task, and a discrimination task. Starkstein et al. (1994) in a 
group study of RHDs, LHDs, and healthy controls found that the patients with RH 
lesions involving the basal ganglia and temporal/parietal cortex were the most likely to 
show aprosody (T/P involvement may reflect the representation of the skill). Patients 
were asked to identify the emotion in a recorded utterance using a facial emotion 
matching task. An attempt to control for facial emotion processing was made by giving 
the subjects a facial emotion recognition task (most, but not all, of the aprosodic sub¬ 
jects performed poorly on this task as well). Infarction of the right thalamus and 
posterior limb of the internal capsule resulted in aprosody in the case of a 49-year- 
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old woman, as reported by Wolfe & Ross (1987). Spontaneous affective prosody was 
spared, but affective-prosodic repetition was impaired, as was affective prosody iden¬ 
tification in a verbal task. (Although left-handed, the patient was assumed to have a 
normal lateralization pattern, since comprehension and production of propositional 
language were intact.) Ross, Harney, deLacoste-Utamsing and Purdy (1981) describe 
a right-handed patient who possessed normal prosodic comprehension but impaired 
prosodic affective repetition and extremely poor spontaneous affective prosody after 
stroke. The site of lesion involved the posterior two thirds of the anterior limb, the 
entire genu, and the anterior third of the posterior limb of the rostral internal capsule 
(RH). Again, propositional speech was essentially undisturbed. 

6. sarcasm. The interpretation of sarcasm involves knowingly rejecting the literal 
meaning of an utterance in favour of some non-literal, usually contradictory, mean¬ 
ing. Understanding sarcasm requires the ability to make contextual inferences. For 
example, of the following statements: 

‘That’s a beautiful dress you are wearing’ (to someone wearing a faded, ragged dress) 

‘That’s a beautiful dress you are wearing’ (to someone in expensive, new clothes) 

the first has a possible sarcastic interpretation while the second does not. A sarcastic, 
non-literal interpretation of the first remark is cued by the discrepancy between the 
literal meaning of the remark and the meaning that would seem appropriate given the 
context. 

The appreciation of sarcasm is at least partly an explicit function. Its explicitness is 
evident from the fact that we can consciously identify the element in the situational 
context that makes the literal interpretation impossible (in the first example, the fact 
that the dress was faded and ragged). Moreover, in everyday conversation the compre¬ 
hension of sarcastic statements is not necessarily automatic: occasionally, the inter¬ 
pretation of a sarcastic comment requires an explicit explanation from the speaker. 

Deficits in processing sarcasm have been linked to right hemisphere damage 
(Brownell et al. 1986) and frontal lobe damage (Alexander et al. 1989). McDonald and 
Pearce (1996) have found loss of ability to interpret sarcastic remarks in patients with 
frontal lobe and subcortical damage (8 of 10 subjects with bi-lateral damage). Their 
test required the subjects to read a short dialogue and then answer questions about 
the exchange. For example: 

Mark: ‘What a great football game’. 

Wayne: ‘Sorry I made you come’. 

Question: Did Mark think the game was good/bad? 

Seven of the ten subjects were also given an affective prosody recognition task. 
Although some of their subjects were impaired on this task, there was no correlation 
between their ability to detect emotional tone and their ability to recognize sarcasm. 
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These results indicate that the interpretation of sarcasm is not dependent on correctly 
processing the prosodic tone in which the remark is uttered. McDonald and Pearce 
also evaluated their subjects’ performance on the Wisconsin Card Sorting Task. Here 
there was a direct correlation between the results on the WCST and results on the sar¬ 
casm interpretation task. Performance on the WCST is usually disrupted after lesion to 
the dorsolateral frontal cortex, which supports the ability to consciously change one’s 
reactions in the face of changing contextual cues (Petrides 1989). Damage to the dor¬ 
solateral frontal cortex may cause difficulty in switching from a literal to a non-literal 
interpretation of an utterance. McDonald and Pearce’s (1996) results are consistent 
with a proposal by Alexander et al. (1989) that lesions to the RH dorsolateral frontal 
cortex may cause disturbances in the appreciation of sarcasm, irony, and inference. 

7 . metaphors. A metaphor is a descriptive term given to an object or action to which 
it is imaginatively, but not literally, related. As in the case of the appreciation of sar¬ 
casm, the comprehension of metaphors necessitates rejecting a literal interpretation 
in favour of a figurative one. The correct interpretation of a metaphor is derivable 
from the context (intra- or extra-sentential) in which it is produced. For example, 
in the sentence: ‘The lady’s jewels were bursting stars’ the non-literal meaning of 
the metaphor is understood through comparing the properties common to the term 
bursting stars and to the other noun in the sentence, jewels. Of the network of con¬ 
ceptual features mapped to the lexical items bursting stars (i.e., ‘bright’, ‘sparkling’, 
‘gaseous’, ‘enormous’) only those features that are shared with the lexical item jewel 
are understood to be relevant in this context (i.e., ‘bright’ and ‘sparkling’). Explicit 
memory is involved in supporting the appreciation of metaphors, since the ability 
improves with instruction (Paradis class notes 1999). The inferencing process used to 
understand metaphors can therefore be explicitly taught. 

Metaphor interpretation was investigated by Bottini et al. (1995) in an experimen¬ 
tal PET study that involved scanning both hemispheres of students participating in 
a metaphor-judgement task. The students were presented with sentences containing 
metaphors, and were asked to indicate whether they found the metaphor plausible or 
implausible. For example: 

‘The investors were squirrels collecting nuts.’ (Plausible) 

‘The investors were trams.’ (Implausible) 

This task condition was compared to a literal sentence processing condition in which 
students judged the sentence’s plausibility at the literal level (i.e.,‘The boy used stones 
as paperweights’—plausible). The results of the comparison showed activation in the 
RH middle temporal gyrus, the RH precuneus (which is reportedly involved in long 
term memory functions such as the retrieval of words from memory (Bottini et al. 
1995)), the RH anterior cingulate (which appears to play a role in conscious, controlled 
processing (Dehaene et al. 1997)), and the RH dorsolateral frontal cortex, which, as 
mentioned in relation to sarcasm, is involved in processing conditional cues. 
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8. predictive models of rh functions. This paper has dealt exclusively with 
prosody, sarcasm, and metaphor. Other aspects of pragmatics were not discussed 
because of a lack of clinical and experimental evidence regarding their relations to 
explicit/implicit memory. It is nonetheless possible, using the definitions of explicit 
and implicit memory, to predict the roles of these memory systems in sustaining 
pragmatic functions not discussed in this paper. For example, the appreciation and 
expression of affective body language (i.e., gaze and gestures) and of facial expres¬ 
sion is largely automatic/involuntary, which suggests implicit memory involvement 
in their representations. Some of the aprosodics in the case studies reviewed were 
impaired in the interpretation of facial expression (Starkstein et al. 1994), however, 
some were not, indicating that although the interpretation of facial expression and 
the recognition of affective prosody may both be implicit pragmatic functions, they 
are also neurofunctionally dissociable. Lexical-semantic information is mediated to a 
certain extent by explicit memory in the LH, which suggests that RH explicit memory 
substrates (i.e., MTL/TP cortex) are involved in lexical pragmatic functions (jokes, 
irony, figurative meanings). The deliberate inferencing of non-literal meanings from 
context may be partially explicit, as McDonald and Pearce’s (1996) study indicates. 
The RH dorsolateral frontal cortex may also be involved in pragmatic functions that 
rely on conscious lexical inferrencing processes (as seen in Bottini et alls 1995 study 
of metaphor comprehension). 

9. conclusion. The intent of this paper was to a) demonstrate a lack of available 
information regarding the manner in which RH pragmatic function is represented, 
and b) provide arguments in terms of an application of implicit/explicit memory 
theory as to why this dearth should be addressed. In this paper I have argued in 
favour of an RH lateralized dissociation between implicit (e.g., affective prosody) and 
explicit (e.g., metaphor processing) pragmatic function. To the extent that appropri¬ 
ate studies were available for review, the data do support an apparent implicit/exp licit 
memory dissociation of RH pragmatics. However, this hypothesis needs to be supple¬ 
mented by further studies. 
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THE AMPLIFICATION CLAUSE ROOT IN SAN MIGUEL EL GRANDE MIXTEC 
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San Miguel el Grande Mixtec (SMM) is spoken in the western highlands of the 
State of Oaxaca, Mexico. The data are taken from more than fifty-five discourses 
including narrative, procedural, behavioral, and expository types having to do with 
the beginnings of various phenomena, the spirit world, causes and cures of various ill¬ 
nesses, causes of various anomalies of nature, customs, and animal and people stories. 
These were collected some 55 to 65 years ago. Many of these were told and then writ¬ 
ten by one of the leading storytellers of the village, Narciso Merecias. The goal of this 
paper is to identify and uniquely characterize the form and function of an interesting 
grammatical structure in SMM. 

1. overview. Very early in my time with the people of San Miguel el Grande, I heard 
and learned a statement about a donkey hurrying down a trail. It was: 

3 Ndaba -tS cuahdn -fa. 

jumppoT -3AML goPOT -3AML 1 

‘It goes bouncing along.’ 

This seemed to me to describe most accurately what the donkey was doing. Then I 
began to notice more of such constructions that primarily, but not exclusively, use 
verbs of directional motion. Also this P (simple predicate) is followed by a S (subject) 
which is coreferential with the S following the second P. The two Ps are never the 
same, but each amplifies the meaning of the other, and I never heard a pause between 
the two PSs. Because of these factors, I have judged the combination to be a special 
sequence, which occurs in the same place in the nucleus of a clause that a simple PS 
group does. So I have labeled it Amplification Clause Root (AmpliCIRt). This struc¬ 
ture may have up to three PS groups. 

A second factor supporting the unity of the AmpliCIRt has come to attention in the 
last several years. Some of the longer texts that were first written by Narciso Merecias 
were recently edited by the very knowledgeable Angel Merecias Sanchez by adding the 
very necessary symbols for tone, glottal stop, length of vowel, and nasalization. The edit¬ 
ing also adds commas indicating phonological breaks, but no periods, which would 
indicate sentence breaks. Importantly for my analysis, there are no commas between 
the PSs of what I’ve identified as the AmpliCIRt. These data give evidence that mother- 
tongue speakers recognize the phonological unity of that structure. 
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Section 2 presents data of clauses (CIRt +/- CIMargins of time, location, manner, 
etc.) in which are the more frequently occurring clause roots (P + S + any obligatory 
or optional argument). Only with this information is there a basis to judge that the 
amplification structure does occur where other clause roots occur. 

Section 3 presents three levels of sentence-independency in relation to Angels 
commas. These sentence features need to be known in order to determine whether 
the amplification structure, without commas, best fits in the nucleus of a sentence 
(normally filled by a clause) rather than in the nucleus of a clause. The data support 
the latter. 

Section 4 presents examples of the content and distribution of the 76 AmpliCl- 
Roots found in the corpus of more than fifty-five discourses. It shows that, indeed, the 
AmpliCIRt occurs in the nucleus of the clause where other clause roots occur. 

Section 5 presents examples that closely resemble the AmpliCIRt, but are shown to 
be contrastive with it. 

Section 6 presents the distribution of aspect in each VP of the AmpliCIRt. How¬ 
ever, such features have yet to be related to aspect and time relations of the larger 
context. 

Section 7 summarizes some of the interesting features of the AmpliCIRt. 

2. more frequent clause structure. In most of the nuclear clause structures the 
order of constituents is rigidly VSO. In this paper the V will be identified as P (simple 
predicate). P, as a slot, is filled by the class Verb Phrase (VP) the minimum of which 
is a single verb. The verb consists of a single verb root or up to a complex of verb 
roots, augmentable by several prefixes. The VP optionally includes some limiting par¬ 
ticles and adverbial phrases following the verb, and is optionally preceded by one of 
two aspect proclitics, which in turn may be preceded by various verb modifiers. In 
parentheses, following the English translation of the examples, is the number of the 
discourse and place in the discourse from which the example is taken. For a statement 
of grammatical levels of structure used here, see Pike and Pike (1982 and 1983). 

Single VRt (bold face) is a minimal VP followed by S (with a hyphen in the Mixtec 
indicating a border between morphemes within a word, and a dot between English 
words indicating the meaning of a single Mixtec morpheme): 

(1) ... te quee nduu -ni mad* -yd*... 

and eatPOT two -just just.that.one -12 

.. just the two of us will eat. .(25.5) 

Example (2) begins with a frequently occurring sentence connector (SC), and then is 
followed by an independent clause which begins with a VP (bold face) in which the 
nucleus is a close-knit verb stem ‘be.happy’ followed by S, and then a clause margin 
with the role of reason. The part which follows jd* is a transitive clause root with its 
VP in bold face : 
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(2) 1 Te yuan te ni ca-cu-saS * xdan*ini -de ji* -nahan 

and then and cmp PL-be-happy very stomach -3FM with -group 
-de, jd* ni ca-nihin* xaan* -de nuju, jnn* parque. 

-3FM that cmp PL-obtain very -3MF weapon with ammunition 
‘Then they all were very happy, that they had obtained many weapons and 
ammunition.’ (54.589-91) 

In (3) the VP is followed by S {-ha* is an enclitic variant of naha ‘woman’ and -un is an 
enclitic variant of yuan ‘that.one’) and Argument (slot) as Undergoer (role), (Arg-U): 

(3) ...te ni quihin -ha* chuun -ha* -un... 

...and cmp take -3FF chicken -3FF -that.one 

.. she caught that specific chicken of hers...’ (25.7) 

Example (4) shows a more complex VP: 

(4) 1 Yuan-na te ni ca -na -caca yata -de ji* -nahan -de. 
then-just and cmp pl -rep -walk backwards -3FM with -group -3FM 
‘Then all these retreated again.’ (54.450) 

In the following examples the nucleus of the clause is in bold face. In addition to 
the Arg-U, Mixtec can also include an Arg-Sc (Scope) which is the direction of the 
action. Following the clause nucleus (a slot), optionally, are clause margins (slots) with 
roles such as time, manner, purpose, location, etc. Slot and role are two of the four 
features of the tagmeme, in our model of Tagmemics. The other two features are the 
filler class (VRt, VP, NP, LocPh, TCIRt, etc.) and cohesion (agreement constraints). Cf. 
Pike and Pike (1982 and 1983). 

Example (5) shows CIRt: P S U + CIMar-Purpose: 

(5) suan x* -ndoo -de aan vansada, * ca -i 

thus cs -remaincNT/poT -3FM one outpost cnt pl -? 

-ndito ichi* Carranza 

-take.care.of road Carranza 

‘.. .in this way they were placing personnel at an outpost in order to throw 
back the Carrancistas.’ (54.455-56) 

In example (6) SC is followed by CIRt (bold face): P S U Sc + CIMar-Loc: 

(6) 'Juan te ni nda -nihin -de batea yuan saquS* -de yuan. 
then and CMP raise-obtain-3FM trough there on.top.of -3FM there 
‘Then, there, he lifted up that trough on top of himself.’ (54.696-97) 
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Examples (1)—(6) present the structure of the most frequently occurring clauses 
in SMM, showing the sequence of P, S, Arg, and CIMargins. If the PSs of the ampli¬ 
fication structure can each have their own arguments and clause margins without a 
comma before a following P, then, probably considering each a separate clause would 
be the better analysis. 

3. independency of sentence. In SMM the S, any Arg, or any CIMar maybe fronted, 
in which case these fronted elements are sentence margins (SentMar). The fronted unit 
is in special prominence according to discourse features. Other sentence constituents 
are sentence connectors (SC) which precede the fronted elements. Depending on the 
occurrence of these sentence constitutents, we propose at least three levels of sentence 
independency within a discourse. The most independent sentences are those intro¬ 
duced by fronted units or by SCs followed by fronted units. Such sentences are indi¬ 
cated by a superscripted 1 ('). The next level of independency are sentences introduced 
by SCs, but no fronted units. Such sentences are indicated by a superscripted 2 ( 2 ). The 
least independent sentences are those with no SCs and no fronted unit, hence they 
begin with the Predicate. They are indicated by a superscripted 3 ( 3 ). These latter are, 
with very rare exceptions, a re-statement of what has already been said. 

This information is relevant to determine if the amplification structure not only 
has the internal structure of a clause, but also the distribution within a sentence that 
a clause has. 

The sentence analysis above has been done with the data of Text 54, which is 
judged to have more than 500 sentences. This is a narrative which was first written by 
Narciso Meredas. It tells of the involvement of the men of San Miguel el Grande with 
the government army of the State of Oaxaca fighting the revolutionaries Venustiano 
Carranza, Francisco Villa, and Emiliano Zapata, before the actual commencement of 
the Mexican Revolution in 1910. This is one of the texts which was edited by Angel 
Meredas Sanchez. In his editing he indicated phonological breaks with commas, but 
no periods at all to indicate grammatical sentences. That problem, then, gave me the 
stimulus to examine closely what I consider to be sentence features of SMM. Every 
period that I’ve introduced in the SMM text replaces a comma. All other commas are 
as Angel wrote them, and which I judge not to indicate sentence breaks. The data here 
show that the amplification sequence fits better as a clause root in the nucleus of the 
clause rather than as a clause in the nucleus of a sentence. 

In (7) the P and S of the clause root are in bold face. Note the SC te plus the 
fronting of S, which is a NP coreferenced in the normal position by the pronoun -de 
(enclitic variant). Two LocPhs (‘at the designated place’ and ‘at the outpost’, each pre¬ 
ceded by a comma) follow: 

(7) 'Te sava -ga fuerza ni ca -s* -quenddo -de, nu* m 
and some/half -again force cmp pl -cs -remain -3FM face cmp 
tetahdn -de, nu* coo vansadd. 

assign.work -3FM face becNT/poT outpost 
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(7) ‘The rest of the forces he required to stay put at the designated place, at 
the outpost.’ (54.438-40) 

In (8) the SC nusaa te and fronted time phrase vita nuhni are followed by a clause root: 

(8) 1 Nusda te vita nuhni nd* cuaha -ri nombramiento -de. 

then and now immediately insis givepoT -11 assignment -3FM 
‘Then right now, I’ll give him his commission.’ (54.103) 

In (9) note the SC te preceding the fronted TmCl and the te that introduces the clause 
nucleus (bold face). Also note in the TmCl that modifies ‘paper’ is an independent 
clause, which in a different environment would be a type 3 sentence, as also is the 
modification of Mr. Soria translated as ‘who would come with it’: 

(9) 1 Te nu* ni jlnu tutu yuanni saha-de, te ni 

and when cmp finish paper there cmp do -3FM and cmp 
jaha-de nuu* suu chaa Soria vai -de jim*. 

give -3FM face def.item man Soria comepoT-3FM with 

When he finished that paper which he had done, he gave it to Mr. Soria 
who would come with it.’ (54.110-11) 

In (10) the SC te appears with no fronted element: 

(10) 2 Te ni jad -de yuan, nuu* chaa * cuu Gobernador. 

and cmp arrive-3FM there face man cnt be governor 

‘He arrived there, in the presence of the man who was the Governor.’ 
(54.16-17) 

In (11) the least independent sentence, beginning with P, summarizes the preceding 
sentences: 

(11) 3 Vdi -de naa tdca calle yuan. 

comepoT-3FM the.whole all street there 
‘He was coming through all those streets. (54.701) 

4. amplification clause root. The PS groups in the amplification structure of 
SMM are a feature of the Otomanguean language family. In languages such as Trique, 
Chatino, Otomi, and other Mixtec languages these structures are described as occur¬ 
ring at a higher level rather than in the nucleus of the clause (Pride 1965 and Alexan¬ 
der 1988). Longacre 1966 describes them as constituents of the sentence. 

In the examples presented so far are given the most frequent PS sequences in the 
nucleus of the clause, along with some other features of the SMM sentence. The data 
are taken from more than 1,500 sentences in texts. Almost all sentences have in their 
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nucleus an independent clause. There are also many independent clauses which are 
modifiers, particularly in a NP (cf. (9)) or introduced by a dependent marker (cf. (2)). 
All these are sources from which at least 76 examples of the amplification structure 
have been collected. The most frequent verb roots involved are directional motion 
ones. There are 52 examples (68%) in which all the verbs are of this type. The VRt quee 
‘leave’ occurs 23 times (30%), but only once as the second P of the group. As the first 
P, it occurs 11 times (14%) with cuahan ‘go’ as the second P. This is the most fre¬ 
quently occurring pair of verbs. There are 16 (21%) examples in which there are no 
directional motion verbs. There are 10 (13%) examples in which one verb is a direc¬ 
tional motion verb. The nondirectional motion verbs have the translations of do, say, 
sleep, fear, think, sing, yell, register (as for a marriage license), carry.by.hand, remain, 
win, throw.out, be.sweet, be.responsible.for, be.situated.on, be.carrying, be.hidden, 
be.located. The Ps of the AmpliCIRt, in this section, are in bold face. These data con¬ 
tinue to support the claim that the amplification sequence is a clause root, especially 
since no comma nor Arg nor CIMar occurs between the PSs. 

In (12) three PSs include quee and cuahan-. 

(12) 1 Yuan -na te ni quee maa* chaa-un cuahan-de 

then -just and cmp leave that.one man-that goPOT -3FM 

cud -nohon -de vehe -de. 

goPOT -go.home -3FM house -3FM 

‘Then that man went away going home to his house.’ (21.9) 

In (13) three PSs appear with only two directional motion verbs: 

(13) 2 Te ni quihin-ni * candahd -ni vdi -ni jnn *. 
and cmp take -2F cnt carry.by.hand-2 f comepoT-2F with 
‘You took it and came carrying it by hand.’ (54.242) 

In (14) two PSs appear with the most frequently occurring verb pairs quee followed 
by cuahan varying to the allomorph jahan. Arg-Sc follows: 

(14) 1 Chda yuan ni quee-de ni jahan-de, huu Nundud... 

man there cmp leave -3FM cmp go -3FM town Oaxaca.City... 

That man, he left, going to Oaxaca City ...’ (54.5) 

In (15) the two Ps are not verbs of directional motion. Note the order of the Ps: drink 
is first, then eat. Then note the order of the Us (not according to the order of their 
verbs): food, coffee, bread. Only coffee, which is in the middle, is what is drunk; the 
other two are what are eaten. If the PSs were not a unit, but rather a sequence of two 
clauses then coffee would have occurred following the verb drink. Also the feminine 
form is used as subject even though men are also eating and drinking. This is because 
women are responsible for providing the food: 
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(15) 2 Te ni ca-jihi ndenuni*-ha ni cd-yee*-ha ndeyu jiin* 

and cmp PL-drink the.three -3FF cmp PL-eat -3FF food with 

cdjuee jiin* statild. 

coffee with bread 

‘And the three couples ate the food, coffee, and bread.’ (21.55) 

In (16) two other directional motion verbs are followed by a time-location phrase. Cf. 

(11): 

(16) 3 M ndava -de vai -de, onde nu.* ni ndu -haa. 

cmp to.enter.again -3FM comepoT-3FM until face cmp become-darkness 
‘He re-entered (the streets) coming to where it was dark.’ (54.704-053) 

Example (17) shows different set of verbs with Sc: 

(17) 2 Te ni quee-de ni ja -cachi-de nuu* lamu. 

and cmp leave-3MF cmp arrive-say -3MF face owner 

‘He went and told the owner.’ (16.25) 

For (18) note that the two Ps (verbs of directional motion) are quee ‘leave’ and nohon 
‘go.home’. The first S is expressed by a long NP including both a woman and two 
males, and the second S, coreferential with the first S, uses the feminine pronoun 
even though two men are involved. Note the plural prefix in the second verb. Once 
again -ha* occurs because it’s the woman’s responsibility for getting the family home. 
Arg-Sc follows the PSs: 

(18) 1 Yuan-na te ni quee hahan yuan jiin* chda yuan jn* se 

then -just and cmp leave woman that with man that with child 

-yu- ha* ca-nohon -ha* vehe -ha* 

-male-3FF pl -go.home -3FF house-3FF 

‘Then that woman, that man, and their son went back home to their 
house.’ (21.63) 

5. counterexamples. There are complex VPs which might at first glance seem to be 
AmpliCIRts, but are separated by a comma, or have two Ps but just one S, or have the 
same P. 

In (19) a comma separates the two PSs. 

(19) 1 Yuan te ni jdcunchaa -de, ni cava -de hayava*. 

then and cmp begin -3FM cmp twist-3FM person 

‘Then he began to gather the people together.’ (54.405-06) 
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A comma separates two PSs in (20): I analyze it as two sentences because the second 
sentence reiterates what has already been said. I have replaced the comma with a 
period: 

(20) 2 Te nl cd-cu -tutu* -i ji* -nahan-i. 3 Nl ca-hlin -i. 
and cmp pl -be-assemble-31 with-group-31 cmp PL-be.located-31 
‘They all came together. They all were there.’ (54.407-08) 

Example (21) is a complex VP followed by one S: 

(21) 2 Te nl ndaha nl ca-saha tuha -de maa* -de ji* 

and cmp to.end cmp PL-do prepare-3FM just.that.one-3FM with 
nahan-de. 

group -3FM 

‘They had finished their precautionary preparations.’ (54.613) 

Example (22) is another complex VP in which the two verb roots are the same, each 
with a different modifier. These units are followed by ini which is obligatory to both 
verb roots to communicate ‘to.decide’ rather than ‘to.remove’, which it would mean 
without ini. There is only one S: 

(22) ...ni tava* ndaa nl tava* cutta* ini -nd... 

...cmp decide true cmp decide absolute stomach -if 

‘... I have decided truly and absolutely ...’ (21.28) 

Example (23) is more ambiguous as to whether what I’ve considered to be in the 
nucleus of the clause is rather in the nucleus of the sentence, as some have claimed for 
other Otomangean languages. In that case yuan would be a clause margin of location 
and the following PS would be the second PS of an Amplification Clause . However, I 
judge that the second PS is modifying yuan to form a LocP and not a second PS of an 
amplification structure. This results in having just one PS in the clause nucleus: 

(23) 3 * ca-i -ndatu-de yuan * ca-hlin-de. 
cnt pl-?- wait -3FM there cnt PL-be? -3FM 
‘They were all waiting there where they were.’ (54.419) 

6. aspect sequences. The combination of aspects in the AmpliCIRt is complex. Here 
will be presented some of the data regarding the aspect combinations in the two SP 
sequences. 

Aspect is indicated in the VP. Time within a discourse is indicated by clause mar¬ 
gins of time or by time units at higher grammatical levels. There are three aspects in 
SMM: 1) potential (pot), 2) completive (cmp), and 3) continuative (cnt). pot is the 
basic form of the verb root. Each aspect may occur with any time unit, pot occurs 
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most frequently with a future time, cmp is indicated by the morpheme nt (varying to 
nf) immediately preceding the verb in the VP. It occurs most frequently with a past 
time unit, cnt is indicated, in the same position as ni, by the continuative aspect 
morpheme which, in smm, has no consonants nor vowels (but does in some other 
Mixtec languages). It has only the tone-changing features of an arbitrary set of mor¬ 
phemes (B class) that changes the tone sequence of the following morpheme, if that 
sequence can be changed. That continuative morpheme is indicated by an asterisk 
on the line (*). Other morphemes in that arbitrary B class are marked by a raised 
asterisk following the morpheme (morpheme*). The aspect of a VP is ambiguous 
(cnt/pot) when the verb has a changed tone sequence (but with no variant having 
segmental phoneme changes) and is preceded by a B class morpheme (with conso¬ 
nants and/or vowels), cnt occurs most frequently with a current time indicator. Cf. 
Pike (1948:77-94) for details of the morphosyntactic tone rules of SMM. 

The occurrence of aspect and time units in a text are very important in under¬ 
standing the structure and the meaning of that text. We begin here by examining 
the distribution of aspect in the 72 examples of the AmpliCIRt taken from the entire 
corpus. This is seen in the first column of the Table 1 (overleaf). In the second column 
are the distribution features of aspect in the 36 examples of the AmpliCIRt taken from 
Text 54. The reason for this contrast is that Text 54 is a narrative and its basic aspect is 
CMP. I haven’t studied in any detail the genres and basic aspect of most of the other 
discourses; it is a heterogeneous group. However, Text 54 is a narrative, hence more 
homogeneous overall, although embedded in it are sections of other types. 

The highest frequency of the nine possible combinations of aspect are cmp + pot: 
21% for the total corpus and 31% for Text 54. This higher percentage for Text 54 might 
possibly have some relation to its basic cmp aspect. The percentage for cmp in one or 
both PSs is 58% for the total corpus, and 64% for Text 54. Such a difference isn’t as 
great as I would have expected. 

In Text 54 there is a very significant diatribe of 39 sentences. Most of the sentences 
are short and include a considerable number of accusing questions. There are three 
examples of the AmpliCIRt (not in the questions) in which both parts are in the pot 
aspect. A fourth AmpliCIRt, which speaks of what has already been said and has three 
PSs, has the three aspects: cmp + cnt + pot (cf. (13)). It may very well be that pot 
aspect in SMM is especially characteristic of a diatribe. 

Much more study is needed to find real relevancy in the distribution of aspect in 
the AmpliCIRt. This would include studying aspect and time units in the larger con¬ 
texts of the AmpliCIRt. 

7. summary. AmpliCIRt: (a) a grammatical unit consisting of 2 or 3 simple predicate- 
subject sequences which fills the nucleus of a clause, hence is a clause root; (b) a 
phonological unit; (c) uses largely directional motion verbs; (d) contrasts with the 
multiple verb complexes in a VP; (e) presents the need to further study the relevancy 
of aspect in its VP. 
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Data from 72 examples 

from total corpus 

Data from 36 examples from 

Text 54 

Same aspect in each PS: 

27 

38% 

Same aspect in each PS: 

16 

44% 

cmp + cmp: 

13 

18% 

CMP + CMP 

8 

22% 

pot + pot: 

6 

8% 

POT + POT 

7 

19% 

CNT + CNT: 

4 

6% 

CNT + CNT 

1 

3% 

Different aspects in each 

PS: 


Different aspects in each PS: 



CMP + POT 

15 

21% 

CMP + POT 

11 

31% 

POT + CMP 

1 

1% 

POT + CMP 

0 


CMP + CNT 

11 

15% 

CMP + CNT 

4 

11% 

CNT + CMP 

2 

3% 

CNT + CMP 

0 


POT + CNT 

0 (one ambiguity) 

POT + CNT 

0 


CNT + POT 

4 

6% 

CNT + POT 

2 

6 % 

cmp in one or both PSs 

42 

58% 

cmp in one or both PSs 


64% 


Table 1. Combination of aspect in the two PS sequence. 

The AmpliCIRt communicates a vivid picture of the speaker’s understanding of 
a complex happening reflected in an infrequently used grammatical structure, link¬ 
ing it directly to a somewhat more simple happening with its more frequently used 
structure, since the two fill the same slot. Once again I mention one of the very first 
instances of this structure that claimed my attention. It was what was said to me about 
the donkey that was hurrying down the trail in front of me: 

3 Ndaba -fa cuahan -tS. 
jumpPOT-3AML gOPOT -3AML 
‘It went bouncing along.’ 

This is a very much more visually accurate picture of that donkey than a more typical 
construction could give it. 

This construction, from my limited experience, is probably quite unique among 
languages in general, hence merits some attention. It differs from the serial verb con¬ 
structions that I have worked with in languages of Nepal and Papua New Guinea. 
Much more investigation is needed to appreciate in greater detail its structure, func¬ 
tions, distributions, and semantic environments. 


Some abbreviations used in this paper: if = 1st formal, 11 = 1st informal, 12 = you & 1 ,2F = 
2nd formal, 21 = 2nd informal, 3FM = 3rd formal male, 3FF = 3rd formal female, 31 = 3rd 
informal, 3AML = 3rd animal, ' = high tone, " = low tone, pl = plural, pot = potential, 
cmp = completive, cnt = continuative, rep = repetitive, cs = causative, insis = insistent, 
sc = sentence connector, CIRt = clause root, U = undergoer of action. Sc = scope, direc¬ 
tion of action, ’•'(asterisk) = morpheme so marked is class B, ^(asterisk on line) = cnt 
morpheme B class which has no form other than triggering a tone shift, - (hyphen) = a 
border between morphemes within a word, a dot between English words = the meaning 
of a single Mixtec morpheme. 
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The number preceding the period is the number of the discourse in my corpus from 
which the example is taken. The number following the period is the unit within that dis¬ 
course. 
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introduction. Knowing whether and how speech perception and production are 
related has important implications for research involving, among many other things, 
first language acquisition (e.g., Vihman 1991), language impairment (e.g., Tallal & 
Stark 1976), artificial speech recognition (e.g.. Rose et al. 1996), as well as linguistic 
and psychological theories (e.g, Labov et al. 1991). Such a link is not necessarily appar¬ 
ent, considering that the two skills—one involving motor control and the other audi¬ 
tory processing—may or may not be controlled by different mechanisms (e.g., Allport 
et al. 1987). However, support for the interdependence of perception and production 
is more positive than negative (e.g., Fox 1982), although the nature of such interde¬ 
pendence is still unknown and heavily debated (Lindblom 1996, Fowler 1996). 

One area of research where the relationship between perception and production 
is less transparent is adult second-language acquisition, which, in comparison to rela¬ 
tively uniform language development in children, is variable along many dimensions 
(e.g., Mack 1989, Flege et al. 1999). Among the most salient are such individual dif¬ 
ferences as amount of second-language experience and age at the time of second- 
language learning. However, by carefully controlling these individual differences, it 
is possible to gain insights into speech processing in general, and second-language 
speech perception and production in particular. Understanding such a relationship 
is important for both theoretical and pedagogical reasons. Theoretically, knowing 
whether perception does in fact influence production abilities may illuminate how 
second languages are acquired (Rochet 1995). For pedagogical reasons, such an under¬ 
standing may help determine what types of second-language training will be most 
effective, and how foreign accents can be minimized (Bradlow et al. 1999). 

In general, three hypotheses about the relationship between speech perception and 
production have been advanced. In both first- and second-language studies, the most 
widely supported hypothesis is that accurate perception is at least one necessary com¬ 
ponent of accurate production (Flege 1995), which translates into perception abilities 
usually surpassing, and therefore preceding, production abilities, especially for begin¬ 
ning second-language learners. Even advanced language learners, whose perception 
and production abilities are nearly asymptotic, perceive some vowels more accurately 
than they produce them (Flege et al. 1999). 

The results of second-language acquisition studies, however, are not conclusive. 
Other researchers working within the same framework have proposed that pro¬ 
duction and perception are interdependent and develop simultaneously (Best 1995, 
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Fowler 1996). Assuming that perception entails the ability to detect distal articula¬ 
tory properties of speech (i.e., tongue movement, vocal tract size, etc.), this hypothesis 
maintains that perception and production are always aligned, so that perception never 
surpasses, and therefore never precedes, production and vice versa. The McGurk 
effect (McGurk & MacDonald 1978), the phenomenon that listeners use both visual 
and auditory information to recover speech, and the results of imitation studies 
(Ryalls & Pisoni 1997), which illustrate that children rely on imitation when learning 
to speak, support the hypothesis that perception and production abilities are interde¬ 
pendent. Speech processing models based on this hypothesis predict that speech per¬ 
ception and production abilities in second-language learners develop in synchrony 
and depend on how difficult it is to recover slight articulatory differences between 
first- and second-language sounds (Best 1995). In support of this hypothesis, for 
example, Flege et al. (1997) demonstrated that, at least for some bilinguals, perception 
and production abilities are aligned. 

In contrast, yet another hypothesis states that accurate production actually pre¬ 
cedes accurate perception, so that some speech contrasts are maintained in produc¬ 
tion before they are actually perceived. Many first-language studies of dialect change, 
for example, have proposed that changes first occur in production before they are sus¬ 
tained in perception (Labov et al. 1991; see, however, Bowie 2001). Second-language 
studies often demonstrate that at least some learners are able to produce differences 
between non-native sounds that they cannot perceive (Flege et al. 1997), such as Japa- 
nese-English bilinguals who can produce a contrast between /r/ and III without being 
able to perceive it (Sheldon & Strange 1982). These findings suggest that there is at 
best only a weak relationship between perception and production, and that the two 
develop independently. 

Conflicting results such as these suggest that the perception/production relation¬ 
ship may change over time. If perception and production abilities are completely 
aligned initially, the first hypothesis proposes that, as the learner gains experience, 
perception precedes production. Conversely, the second hypothesis would predict 
that both perception and production would be aligned regardless of language expe¬ 
rience. Finally, the third hypothesis would predict that production may in fact out¬ 
strip perception as a second language is learned. Therefore, observing the development 
of perception and production abilities maybe essential in differentiating among these 
three hypotheses. Thus, the primary objectives of this study were (1) to determine 
which of the three hypotheses most accurately describes the development of percep¬ 
tion and production abilities in second-language learners, and (2) to explore what 
other factors may influence or explain the perception-production link. 

1. relationship between perception and production. To explore the develop¬ 
ment of second-language speech perception and production abilities, in this study 
Korean-English bilinguals performed perception and production tasks of English 
vowels. To specifically understand how the relationship between the two skills devel¬ 
ops, the results from these experiments were correlated and examined on an indi- 
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AOA 

LOE 

Chron. Age 

Use of Korean 

Late+3 
Late+10 
Early+10 

K. Mono 

E. Mono 

23.96 (3.69) 
21.35 (2-82) 
8.98 (1.18) 
29.02 (2.68) 

3.04 (0.40) 
9.82 (2.26) 
11.05 (1-88) 
0.23 (0.10) 

27.28 (3.52) 
32.08 (2.67) 
20.33 (1-63) 
29.25 (2.69) 
25.60 (4.53) 

63.00 (20.03) 
44.00 (22.21) 
19.00 (20.25) 
74.00 (22.21) 


Table 1.Means of age of arrival (AOA), length of experience (LOE), chronological age 
(Chron. Age), and percentage of use of Korean (Use of Korean). Standard deviations 
appear in parentheses. 


lil 

lil 

Ini 

M 

/as/ 

/£/ 

beat 

bit 

boot 

book 

bat 

bet 

bead 

bid 

booed 

good 

bad 

bed 

heed 

hid 

who’d 

hood 

had 

head 


Table 2. Vowel stimuli used in the study. 

vidual basis. In Part 1, the influence of age at the time of learning (age) and amount 
of exposure (experience), as indicators of linguistic experience, on the perception/ 
production relationship was evaluated to understand the developmental component 
of speech processing. In Part 2, the data were re-analyzed along factors other than age 
and amount of exposure to gain better insights into the role of individual differences 
in the perception/production relationship. 

1.1 subjects. Thirty adult Korean-English bilinguals participated in the study. Bilin¬ 
guals differed in age of first exposure to English, defined as age at the time of arrival 
in the United States (early vs. late), and the amount of linguistic experience, defined 
as length of stay in the United States (3 vs. 10 years). In addition, 10 English mono- 
linguals and 10 Korean monolinguals participated as comparison groups for a total 
of 50 participants altogether. Korean monolinguals had resided in the United States 
for an average of 3 months and were beginning learners of English. Based on age of 
arrival and length of residence, the bilinguals were assigned to three groups, with 10 
subjects in each group. Table 1 presents demographic variables for the groups used in 
the study. 

1.2. stimuli. Most previous studies which explore the relationship between second- 
language perception and production examine difficult consonant contrasts (Sheldon 
& Strange 1982, Bradlow et al 1999). In order to extend the results of these previous 
studies to vowel contrasts, we chose three English vowel pairs which are confusable 
and particularly difficult for Korean-English bilinguals to both perceive and produce 
(Flege et al. 1997): lil-lil, /u/-/u/, and /ae/-/e/ placed in the phonetic environments 
bVt/k, b/gVd, and hVd (see Table 2). 
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The stimuli used in experiment 1, the speech perception experiment, were pro¬ 
duced by three native English speakers and recorded using a Shure unidimensional 
head-mounted microphone and Sony tcd-d8 dat recorder. All speakers were English 
monolinguals from Illinois and had had no or minimal exposure to a foreign lan¬ 
guage. Recordings were digitized at 22,050 Hz, normalized for peak intensity, and 
ramped off during the first and last 15 msec, to prevent audible clicks. Recorded stim¬ 
uli were not edited in any other way and were representative of the variability in 
fundamental frequency, vowel duration, stop-consonant closure duration, and voice- 
onset time which are found in natural speech. Prior to the experiment, 10 monolin¬ 
gual speakers of English identified the stimuli with a 98% accuracy in a fixed-choice 
identification task. 

1.3. procedure. In order to obtain a measure of the bilinguals’ perceptual abilities, 
experiment 1 used a fixed-choice identification task. In this task, subjects heard an 
English word presented over the headphones and matched it, on the computer screen, 
with one of six response alternatives labeled orthographically as English words. The 
stimuli on each trial were played individually, with each new trial presented 1 sec. 
following the subject’s response to a previous trial. No feedback and no practice ses¬ 
sion were given. However, subjects could listen to each token as many times as they 
wished. Six test stimuli in hVd context recorded by three speakers were presented in 
three randomized lists for a total of 54 one-stimulus trials per subject. The perception 
score (or percent correct in identification) obtained in this experiment was used as a 
measure of bilinguals’ perception abilities. 

To explore the relationship between the bilinguals’ ability to perceive and produce 
the same sounds, in experiment 2 the same bilinguals performed a word production 
task. In this task, bilinguals used the word stimuli from experiment 1 to name black- 
and-white line drawings. For example, the word bat was used to name the drawing 
depicting a baseball bat, whereas the word booed was used to name the drawing depict¬ 
ing sports fans at a stadium. While some drawings did not unambiguously depict the 
objects to be named, the subjects were familiarized with the intended words during 
the study phase. During the study phase, bilinguals twice viewed each drawing and 
repeated the intended word recorded by a female native English speaker. The intended 
word was presented first in a sentence, which provided the necessary context, and then 
in citation form. In the test phase, which contained two blocks of 18 randomized trials, 
subjects attempted to spontaneously name the drawings. When no response was given, 
the expected name was played over a loudspeaker and the subject repeated it. Overall, 
each subject produced 18 words for a total of 720 words altogether. Only spontaneous 
productions from the test phase were used in subsequent analyses. 

Tape recordings of the production data were further used to obtain word produc¬ 
tion ratings. Digitized at 22,050 Hz, normalized for peak intensity, and ramped off 
during the first and last 15 msec, to prevent audible clicks, the 720 word tokens were 
randomized and recorded onto a high-quality audio tape, with each stimulus presented 
with a 4-second interval. To obtain word production ratings, this audio tape was played 
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E.Mono. (1) 

Early+10 (2) 

Late+10 (3) 


mean s.d. 

mean s.d. 

mean 

s.d. 

Perc. 

94.26 4.57 

95-19 

5.60 

59.81 

10.69 

Prod. 

96.39 1.98 

91.89 

7.20 

64.00 

10.65 



Late+3 (4) 

K.Mono. (5) 

ANOVA 


mean 

s.d. 

mean 

s.d. 

F (4.45) Tukey HSD 

Perc. 

62.96 

18.52 

50.37 

13-46 

31.30**1,2 > 3,4,5 

Prod. 

57-67 

14.46 

50.39 

9-53 

46.03**1,2 > 3 > 5 


Table 3. Percentage of vowels perceived and produced correctly by the subjects in the 
five groups. ‘HSD’ indicates the results of a Tukey HSD test (alpha = 0.05; **p < 0.001) 
comparing the performance of the five groups on perception and production tasks when 
the group factor reached significance. 

to a group of 10 English monolingual listeners for identification. In this task, the Eng¬ 
lish monolingual listeners were instructed to choose, on an answer sheet, one of four 
response alternatives. For example, for the word token ‘bit’, the response alternatives 
were: ‘bit’, ‘beat’, ‘bet’, or ‘bid’. The production ratings (or percent correct in production) 
obtained in this task were used as a measure of bilinguals’ production abilities. 

1.4. results and discussion. Calculated across vowels and phonetic environments, 
perception and production data were submitted to two separate one-way analyses 
of variance (ANOVAs), using age at the time of learning (AOA) and length of expe¬ 
rience (LOE) as the grouping variable. Because the group factor (AOA and LOE) 
reached significance for percent correct in perception [f(4,45)=3i.3o, p=o.oo] and 
production [^(4,45)^46.03, p=o.oo], Tukey HSD post hoc tests were used to pursue 
between-group differences (see Table 3). Interestingly, AOA, not LOE, predicted 
the bilinguals’ perception and production performance, therefore suggesting that 
only early bilinguals are able to both produce and perceive vowels similarly to Eng¬ 
lish monolinguals. Similarly, both groups of late bilinguals performed identically to 
Korean monolinguals in perception and production. 

To test the claim that perception and production abilities may differ for bilinguals 
assigned to the same age and experience group (within-group differences), separate 
t-tests were performed within each group on perception and production data. Surpris¬ 
ingly, however, perception and production abilities were identical within each group 
(p>0.05), thus indicating that all groups both perceived and produced sounds with the 
same accuracy. These results maybe taken to indicate that perception does not precede 
production, and that these two perceptuomotor skills develop simultaneously. 

To further explore the relationship among the variables, bilinguals’ production 
and perception scores were correlated with demographic variables. Shown in Table 4 
(overleaf), the results attest to a strong relationship among the variables. Perception 
and production data do strongly correlate positively at the 0.71 level. Furthermore, 
although both correlations are strong and significant, age at the time of learning 
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Production 

Perception 

Perception 

Age or Arrival 

Use of Korean 

Length of Experience 

0.71 ** 
-0.82 ** 
-0.60 ** 
0.57 ** 

-0.69 ** 

-0.53 ** 

0.41 


Table 4. Correlations between perception, production scores and subject variables 
(alpha= 0.05; **p < 0.001J. 

correlates more strongly with production than with perception accuracy. This sug¬ 
gests that perception is perhaps less susceptible to the influence of age and experi¬ 
ence than is production. Assuming that perception and production abilities, at least 
in some measure, develop as a function of age, this result may also indirectly indi¬ 
cate that perception does, in fact, precede production. Speculative and perhaps pre¬ 
mature, such a conclusion, however, merely confirms that perception and production 
are related; it does not indicate whether one ability precedes the other. 

Indeed, the obtained group data and correlations do not provide accurate insights 
into how perception and production abilities are related, nor how such a relationship 
develops as a language is learned. Presumably, our failure to detect oft-cited within- 
group differences between perception and production in bilinguals (Sheldon & 
Strange 1982, Flege et al. 1999) was an artifact of the grouping variables used in the 
analyses—age and amount of experience. Therefore, to specifically address this issue, 
we performed an individual analysis of the data, paying close attention to intra-sub¬ 
ject differences in perception and production accuracy. 

In close agreement with perceptually based accounts of the development of bilin¬ 
gual perception and production abilities (e.g., Flege 1995, Rochet 1995), we hypoth¬ 
esized that good production would be associated with good perception, and poor 
production with poor perception. For intermediate learners, such a hypothesis would 
entail a possible misalignment of both skills (a characteristic of an intermediate 
learning stage) if accurate perception is indeed an important factor in the develop¬ 
ment of accurate production. 

Based on the bilinguals’ production abilities, instead of age or experience, data 
re-analysis yielded 3 groups of bilinguals: those with good (85-100%), intermediate 
(70-85%), and poor (less than 70%) production accuracy. In addition, perception 
and production scores obtained for Korean and English monolinguals were used for 
comparison purposes. Perception and production accuracy, calculated across vowels 
and phonetic environments, was examined in two one-way ANOVAs to determine 
whether the resulting groups indeed differed in these two skills. Presented in Table 5, 
results included a significant main effect of group for both perception [E(4,45^=23.34, 
p= 0.00] and production [f(4,45)=97.90, p=o.oo], indicating that both skills did 
indeed differ across the three new groups. However, no differences emerged within 
each group when individual perception and production performance was compared 
(p>0.05), suggesting that perception and production appeared again to be aligned. 
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E.mono. (1) 
(N=io) 

Good (2) 
(N=8) 

Interm.+ (3) 
(N= 3 ) 

Interm.- (3) 

(N= 5 ) 

Poor (4) 
(N=i 4 ) 

Perc. 

94.26 

95-83 

93-83 

55-19 

61.11 


( 4 - 57 ) 

( 5 - 84 ) 

(4.28) 

(16.58) 

(11.28) 

Prod. 

96.39 

94.86 

78.34 

75-78 

54-48 


(1.98) 

( 3 - 25 ) 

( 5 - 30 ) 

(5-86) 

( 9 - 13 ) 



K.mono. (5) 
(N=io) 

ANOVA 

F (4.45) Tukey HSD 

Perc. 

50-37 

(13-46) 

23-34 ** 

i, 2 > 3 , 4,5 

Prod. 

50-39 

( 9 - 53 ) 

97.90 ** 

i, 2 > 3 > 4,5 


Table 5. Percentage of vowels perceived and produced correctly by Korean-English 
bilinguals (individual reanalysis). ‘Intermediate +’and ‘Intermediate -’indicate the 
bilinguals with intermediate production skills and respectively high and low perception 
skills. ‘HSD’ indicates the results of a Tukey HSD test (alpha = 0.05; **p < 0.001) com¬ 
paring the performance of the five groups when the group factor reached significance. 

Surprisingly however, the bilinguals whose production accuracy was intermedi¬ 
ate (75-80%) displayed a rather large amount of variability in their perception scores 
given similar production performance. As Figure 1 (overleaf) depicts, in half of these 
bilinguals, perception accuracy was in fact identical to the monolingual perception 
norm and surpassed production accuracy. Assuming a perceptual basis for accurate 
speech production, for these bilinguals production ability may, in fact, eventually 
align with their perception ability and the monolingual production norm. However, 
the other half of the bilinguals in this group attained better accuracy in production 
than perception (e.g., Sheldon & Strange 1982). It may be the case that either pro¬ 
duction precedes perception for these bilinguals, thus lending support to the third 
hypothesis entertained earlier, or perhaps factors other than perception underlie pro¬ 
duction accuracy in these individuals. 

One such factor might be the ability to perceive contrasts of a speaker’s own pro¬ 
ductions. There is empirical evidence in first-language acquisition that in order to learn 
to speak, infants, through babbling and cooing, may attempt to match the acoustics 
of their own productions with the acoustics of others’ speech, thus defining categories 
for native-language sounds (Vihman 1991, Kuhl & Meltzoff 1996). In fact, auditory self¬ 
perception, or the ability to hear one’s own speech, is deemed critical for establishing 
accurate perceptual and articulatory representations in early language development. 
Similar evidence is marshaled in research on language-impaired children, in whom 
inaccurate language production is frequently associated with an inability to under¬ 
stand one’s own speech, given an overall superior ability to understand others (Panagos 
& King 1975). Findings such as this strongly implicate self-perception as a necessary 
and perhaps fundamental element in the native-language production ability. 
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80 

0 

CD 


O 

O 

60 

C 


0 

O 

40 

0 

CL 

20 


0 



Good perception Bad perception 


Figure 1. Percentage of vowels perceived and produced correctly by Korean-English 
bilinguals with intermediate production accuracy. 


Extended to second-language acquisition, a similar claim may offer a feasible 
explanation for the somewhat surprising finding of the present study. It may be that 
auditory self-perception provides the perceptual basis for superior production abili¬ 
ties for those speakers whose production abilities exceed their perception abilities, 
allowing for the needed mapping of auditory targets onto accurate articulatory tar¬ 
gets. More importantly however, from a theoretical perspective, such an explanation 
would accentuate perceptual bases of accurate production, much in line with most 
research in the field (e.g., Flege 1995, Rochet 1995). 

Beyond doubt, however, auditory self-perception in the absence of the ability to 
accurately perceive others may not in itself translate into accurate production. Per¬ 
haps what self-perception, as part of an auditory-articulatory feedback mechanism, 
underlies is the ability to compare ones own attempts at speech with others’ produc¬ 
tions. Essential to both first- and second-language learning, self-perception cast in 
such terms would entail the ability to perceive differences between one’s own speech 
and the auditory targets in others’ speech, thus allowing speakers to modify their 
speech to match these targets. 

Because both production and self-perception are contingent upon hearing audi¬ 
tory targets in others’ speech, we hypothesize that, for all bilinguals, perception will 
be better than, or equal to, production and self-perception. By the same token, bilin¬ 
guals with good production will be equally good at perception and self-perception, 
because they are able to match their own productions with the auditory targets in 
others’ speech. On the other hand, bilinguals with poor production skills will also be 
poor at both perception and self-perception because they are unable to match their 
own productions with the auditory targets in others’ speech. However, bilinguals with 
intermediate production skills will, in large measure, perform better at perception 
and self-perception than production, because, as hypothesized earlier, the ability to 
compare one’s own speech with auditory targets in others’ speech is essential in devel¬ 
oping accurate production skills. 
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100 



m 


□ Production 
■ Perception 

□ Self-perception 


Good Interm.+ Interm.- 


Poor 


Production Accuracy 


Figure 2. Production, perception, and self-perception accuracy for Korean-English 
bilinguals. Dashed lines indicate an average for production and perception perfor¬ 
mance for English monolinguals (96.39 in production and 94.26 in perception) and 
Korean monolinguals (50.39 in production and 50.37 in perception). 

2. SELF-PERCEPTION AS A LINK BETWEEN PERCEPTION AND PRODUCTION. Testing 

these predictions, we asked the same bilinguals to return and perform a word identi¬ 
fication task on their own speech to obtain a measure of self-perception. A bilingual’s 
self-perception score was thus defined as the ability to perceive one’s own speech 
identically to the way English monolinguals did in experiment 2. The stimuli and pro¬ 
cedure in this experiment were identical to the production rating task performed by 
English monolinguals in experiment 1. Self-perception scores were available for four¬ 
teen bilinguals. 

2.1. results and discussion. Chi-square tests comparing perception, self-perception, 
and production data confirmed our predictions (see Figure 2). More specifically, rep¬ 
resenting asymptotic performance, the bilinguals whose productions skills are good 
(N=6) are equally good at perception and self-perception [y 2 =o.8o, £>0.05]. Fur¬ 
thermore, the bilinguals whose production skills are poor (N=4) are equally poor 
at perception and self-perception [y 2 =3.i2, £>0.05], performing at chance level and 
therefore marking the beginning of second-language development. Overall, in both 
of these stages of second-language learning, perception is completely aligned with 
production. 

The role of self-perception in the developing relationship between perception and 
production is however most salient in scores of bilinguals with intermediate produc¬ 
tion skills; i.e., where most of the learning occurs. Of the four bilinguals with inter¬ 
mediate production skills who returned to perform this task, two performed better 
at perception than production (Interm. + group). Indeed, as was predicted, their self¬ 
perception and production scores are identical (p>o.05) and intermediate in accu¬ 
racy. The other two bilinguals in this group performed better at production than 
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perception (Interm.- group). Identical to their production accuracy (p>o.05), self¬ 
perception accuracy of these bilinguals tended to exceed their perception scores 
1 x 2= 3 - 49 > p=o.o6]. Implicit in these results is the claim that self-perception, or the 
ability to distinguish ones own productions from others’ speech, is essential in 
the development of production abilities. Indeed, the bilinguals in the Interm. + group 
belong in the developmental stage in which their self-perception ability is improving, 
thus enabling them, with more experience, to attain native-like accuracy in a second 
language. Conversely, the bilinguals whose production surpassed perception accu¬ 
racy, a somewhat surprising outcome of this study, are good self-perceivers. Being 
able to perceive their own speech accurately thus enables them to produce sounds 
accurately. However, these bilinguals may be less likely to attain native-like accuracy 
in production because, in addition to a good self-perception ability, accurate produc¬ 
tion should perhaps be grounded in accurate perception. 

3. general discussion and conclusions. Of the three hypotheses entertained in 
the introduction, overall this study supports the hypothesis that accurate production 
is contingent on accurate perception in a second language, and that perception and 
production abilities may not necessarily be aligned. Indeed, throughout the learning 
process, perception precedes production. However, we propose that self-perception 
is important and perhaps even essential in the development of accurate production 
skills. In fact, self-perception, as both an auditory and articulatory feedback mecha¬ 
nism, may be a necessary link between speech perception and speech production. 
That is, without self-perception, or the ability to compare one’s own attempts at 
speech with the speech production of others, and regardless of whether such a com¬ 
parison occurs at an auditory (e.g., Flege 1995) or articulatory (e.g.. Best 1995) level, 
second-language learners may not be able to link auditory perceptual targets with 
their own articulation efforts. The influence of self-perception may also explain situ¬ 
ations where on the surface it appears that production precedes perception. In actu¬ 
ality, as self-perception data demonstrate, accurate production is nevertheless still 
perceptually based (Newman 1998). 

An important implication of this conclusion for speech processing in general and 
second-language research in particular is that self-perception is essential in learning 
sounds, enabling speakers to gradually approximate their production ability to their 
perception ability. Subjecting this hypothesis to more rigorous testing in both first- 
and second-language research may further illuminate the relationship between per¬ 
ception and production in speech processing. 
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this paper 1 discusses an extensive cross-linguistic vowel-discrimination 
experiment and relates the results to various hypotheses concerning speech percep¬ 
tion. Section 1 describes the experiment; section 2 presents some of the results; and 
section 3 gathers together certain points mentioned earlier and relates them briefly to 
wider issues. The remainder of the introduction is devoted to a brief account of the 
vowel-discrimination part of the work of Stevens et al. (1969), which will serve as a 
convenient introduction to our own experiment. Further discussion of their article 
will, however, be saved until section 3. 

Stevens et al. (1969) ran two vowel-discrimination tests, each featuring a 13-step 
synthetic vowel continuum, with the aim of determining how well native speakers of 
American English and of Swedish could discriminate pairs of vowels 1 step, 2 steps 
and 3 steps apart. For one test the continuum involved front unrounded vowels and 
covered both the English contrast between HI, HI and Id and the Swedish contrast 
between HI, Id and /a/. The other test involved a continuum taking in the Swedish 
vowels /i y u/; it thus involved the rounded vowels [y tt], which exist in Swedish but 
not in American English. Figure 1 shows the discrimination functions for the two 
groups for the l-step pairs of this test. (The curves for the 2-step and 3-step pairs 
involved higher percentage scores and were also flatter.) Despite the fact that the 
Americans had no phonemes corresponding to [y] and [u], they had discrimination 
peaks in the same place as the Swedes, and the correspondence between the two 
groups in this test was in fact closer than in the test on unrounded vowels. Stevens 
et al. (1969:12) interpreted this result as indicating that the discrimination peaks are 
present universally as part of our hearing mechanism prior to the learning of any 
particular phonemic system: 

If we extrapolate our results to other vowels and to other languages, the conclu¬ 
sion to be drawn is that, for vowels presented in isolation, the listeners’ linguistic 
experience has essentially no effect upon their ability to discriminate small differ¬ 
ences in the vowel formant frequencies. 
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Vowel pair 


Figure i. Discrimination functions for l-step vowel pairs along a continuum taking 
in [iytt], as judged by Swedish and American listeners (adapted from Figure y of 
Stevens et al. 1969:11). 



Figure 2. (a) The primary and secondary cardinal vowels, arranged according to Fi 
and F2; (b) the high-low continua of tests A-H; and (c) the front-back continua of 
tests I to P. 


In the present experiment we set out to investigate the extent to which such extrapo¬ 
lation is well-founded, by covering all areas of the vowel space—high, low, front, back, 
rounded and unrounded—and using native speakers of a wider range of languages as 
subjects, namely English, German, Japanese, Mandarin and Cantonese. 

1. the experiment. The synthetic vowels of the experiment include all of Daniel 
Jones’ (1960:31-39) eight ‘primary cardinal vowels’ [i e e a a 0 o u] and the correspond¬ 
ing eight ‘secondary cardinal vowels’ [y 0 oe ce 0 a y ui] (which have the same tongue 
position as the primaries but a different lip position). From an acoustic point of view, 
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(a) 1 2 3 4 5 6 7 8 9 10 11 12 

1 1 1 

13 14 15 16 17 18 19 

1 

intermediate vowels 

(e.g.) [i] [e] 

intermediate vowels 

[e] 

(b) all 19 stimuli paired with themselves, played twice 

(38) 

all 18 1-step pairs, in AB and BA order 

(36) 

all 17 2-step pairs, in AB and BA order 

(34) 

all 16 3-step pairs, played once each, with alternating 

AB and BA order (1-4, 5-2, 3-6, 7-4...) 

(16) 


Figures. M The stimuli for a given test; and (b) the 124 pairs of vowels of each test. 


on the basis of the frequency of formants 1 and 2, the eight secondary cardinal vowels 
(CVs) are located between the front series [i e e a] and the back series [oiou] of the 
primary CVs, as shown in Figure 2(a). 

The experiment comprises sixteen separate tests, each featuring an acoustic con¬ 
tinuum taking in three adjacent CVs in either the vertical or the horizontal dimension 
of Figure 2(a). The eight high-low continua are shown in Figure 2(b) and the eight 
continua involving front-back (and rounded-unrounded) distinctions are shown in 
Figure 2(c). It would have been simpler to have organized the synthetic stimuli into 
only eight tests, one for each of the rows and columns of Figure 2(a). However, we 
wanted to operate with smaller intervals than Stevens et al. (1969) and synthesized 28 
acoustically distinct vowels along each of these dimensions; and because we wanted 
to present all 3-step, 2-step, l-step and o-step (i.e., identical) pairs for judgments, each 
such test would have taken approximately 30 minutes to run. By using shorter, over¬ 
lapping continua and doubling the number of tests, we were able to reduce the time 
taken by each test to around 15 minutes, thereby reducing the burden of concentra¬ 
tion on the subjects in any one test. 

Figure 3(a) shows how each continuum was divided up. Eight intermediate vowels 
were synthesized between each of the pairs of adjacent CVs; thus the three CVs of 
a given test were stimuli numbers 1,10 and 19. For the CVs themselves the frequen¬ 
cies of the first three formants were derived from analysis of Daniel Jones’ recording 
and for the intermediate vowels the formant frequencies were calculated in such a 
way that the steps were equidistant logarithmically. The 19 3-formant vowels of each 
test were synthesized, with a falling intonation (or tone) on a JSRU parallel formant 
synthesizer at University College London. Each test contains 124 randomized pairs of 
vowels, made up as indicated in Figure 3(b), and the task is to judge each pair as ‘same’ 
or ‘different’. 

The subjects who took part in the experiment were 5 groups of 16 students, aged 
in their late teens or early twenties (with the exception of one English girl who was 
14 years old). The Mandarin speakers were all natives of Beijing and the Cantonese 
speakers natives of Hong Kong. The English speakers were all natives of London and 
the Germans natives of Diisseldorf. Fourteen of the Japanese subjects were natives of 
Osaka and two came from Kyoto. The tests were played through loudspeakers in a lis- 
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tening room at the Institute of Acoustics in Beijing, in language laboratories in Osaka 
and London, in a music room in Diisseldorf and in a classroom in Hong Kong. 

2. THE RESULTS. 

2.1. general. Over all five groups of subjects, the identical pairs (o-step pairs) 
received the highest percentage-correct score, averaging out at 8o%. Next came the 
3-step pairs (73%), then the 2-step pairs (53%), then the l-step pairs (34%). As also 
in the case of Stevens et al. (1969), discrimination peaks are more in evidence in the 
l-step results. It makes sense, therefore, to concentrate on these (see section 2.2). But 
first it is of interest to consider the overall performance of each group on the 16 tests. 

Figure 4 shows the overall percentage-correct score (across stimulus-pairs of all 
four types) for each of the 16 tests for each of the five language groups. Analysis of 
variance (using the statistical package Minitab), carried out earlier on the English, 
Mandarin and Japanese results and reported in Bennett et al. (1992), revealed inter 
alia that there are highly significant differences between these groups. Moreover, this 
is true even after allowance is made for differences in the overall level of performance 
of the groups (which might be due in part to differences in the experimental condi¬ 
tions, such as the rooms in which the tests were played). As regards the ranking of 
the tests for a particular group, the differences in the percentage scores are often so 
small that one would expect the ranking to come out differently if the experiment 
were repeated. Nevertheless, several similarities and differences between the groups 
are significant. For instance, all groups performed well on tests A and B, covering the 
range [i e e a], and much less well on tests O and P [a ce o a]. The most striking dif¬ 
ference between the groups involves the performance of the English and Mandarin 
groups on tests I and J, covering the range [i y ui u]. For the English subjects, J was 
the easiest test and I the second easiest. The Mandarin group, on the other hand, per¬ 
formed worst on test I and not much better on test J. A further point of interest is 
that the English group performed better in general on the front-back tests (I - P) by 
comparison with the high-low tests, whereas the Mandarin group performed better 
on the high-low tests than on the front-back tests. In both cases the difference was 
highly significant statistically. (These points will be taken up again in section 2.4.) 

2.2. discrimination peaks. Figure 5 (overleaf) presents discrimination functions for 
the l-step pairs of test A, involving the continuum [i e e], which was one of the tests 
on which all five groups performed well. It will be seen that all five groups were good 
at discriminating between stimulus-pairs 3-4,8-9 and 15-16; and it would be reason¬ 
able to suggest that there are discrimination peaks at these three positions along the 
continuum for all five groups. However, in order to avoid subjectivity in the matter of 
deciding where there are discriminations peaks, it is desirable to decide on a precise 
definition of what constitutes a peak. The definition that we chose to employ is con¬ 
servative rather than liberal, i.e., certain bars in charts such as those in Figure 5 that 
one might intuitively regard as peaks turn out not to qualify as peaks according to our 
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English 

Mandarin 

Japanese 

Cantonese 

German 

Test 

% 

Test 

% 

Test 

% 

Test 

% 

Test 

% 

J: y ui u 

73.6 

B: e £ a 

57.2 

A: i e e 

59.5 

B: e e a 

60.3 

B: e £ a 

67.9 

I: i y ui 

70.4 

D: 0 oe ce 

56.6 

B: e e a 

58.5 

A: i e e 

60.0 

A: i e e 

64.4 

B: e E a 

68.5 

H: 0 3 a 

55.6 

I: i y ui 

57.9 

D: 0 oe ce 

58.6 

I: i y ui 

64.2 

N: oe a a 

68.5 

C: y 0 oe 

55.5 

J: y ui u 

55.2 

H: 0 3 a 

58.3 

Jryiuu 

64.0 

A: i e E 

68.0 

A: i e e 

55.2 

H: 0 3 a 

55.1 

I: i y ui 

57.6 

G: u 0 3 

63.1 

K: e 0 y 

67.2 

K: e 0 y 

53.1 

C: y 0 oe 

54.9 

G: u 0 3 

55.8 

D: 0 oe ce 

61.9 

D: 0 oe ce 

65.9 

H: 0 3 a 

53.0 

D: 0 ce ce 

54.3 

K: e 0 y 

55.6 

N: oe a 3 

61.9 

M: e oe A 

63.4 

O: a ce o 

53.0 

K: e 0 y 

54.2 

J: y ui u 

55.4 

H: 0 3 a 

61.7 

H: 0 d a 

63.2 

M: E CE A 

52.2 

G: u 0 3 

53.4 

C: y 0 oe 

54.1 

G: u 0 3 

61.4 

L: 0 y 0 

62.2 

N: oe a 3 

52.0 

M: e oe a 

52.4 

F: Y A n 

54.0 

K: e 0 y 

61.0 

O: a <e d 

61.8 

G: u 0 3 

51.3 

N: oe a 3 

51.5 

L: 0 y 0 

53.0 

G: u 0 3 

59.2 

C: y 0 oe 

61.4 

J: y ui u 

50.3 

E: ui y a 

51.2 

M: e oe A 

52.7 

L: 0 y 0 

58.4 

G: u 0 3 

60.8 

F: Y A D 

50.2 

P: ce n a 

51.2 

O: a ce d 

51.7 

C: y 0 oe 

58.2 

F: Y A D 

57.7 

L: 0 y 0 

50.1 

F: Y A n 

50.6 

E: ui y A 

51.6 

M: e oe a 

57.4 

P: ce n a 

56.5 

E: ui y A 

48.4 

L: 0 y 0 

50.3 

N: oe a 3 

51.3 

E: ui y a 

56.1 

E: ui y a 

56.1 

P: ce n a 

47.5 

O: a ce d 

49.9 

P: ce d a 

48.5 

F: Y A D 

54.4 



I: iyui 

46.9 





O: a ce n 

50.6 



I: i y ui 

46.7 





P: ce d a 

48.7 


Figure 4. The 16 tests, A-P, ranked according to the overall percentage-correct score 
for each of the five groups 2 . 


definition 3 . The bars that do constitute peaks according to our definition are shown 
as dark bars. In the English chart there are two such peaks; the percentage correct 
score for the pair 3-4 does not differ sufficiently from its two immediate neighbors 
to count as a peak. In the Mandarin chart there is only one peak, corresponding to 
the pair 15-16. The other two candidates for the status of peaks in this chart are ruled 
out not so much by their own percentage values as by the fact that the overall level of 
performance of this group was relatively high. In the Japanese chart, as in the English 
chart, two of the three candidates for the status of peaks count as peaks according to 
the definition and the third, for pair 3-4, does not. In the Cantonese chart, all three 
candidates count as peaks; and it will be noticed that the overall level of performance 
is lower than that of the Mandarin group. Finally, the German chart has peaks not 
only at the same three places as the Cantonese chart but also two additional peaks, for 
pairs 6-7 and 13-14. What is also notable about the German chart is that, apart from 
the peaks, the overall level of performance is quite low. It contrasts in this respect with 
the Mandarin chart. 

We may contrast the l-step discrimination functions for test A (Figure 5, overleaf) 
with those for test I (Figure 6, overleaf), which covers the range [i y ui]. This was a 
test on which the English group performed well and the Mandarin group performed 
badly. It will be seen that the English and German subjects had one peak, situated at 
stimulus-pair 10-11, whereas the Mandarin, Japanese and Cantonese charts exhibit a 
relatively even performance over the 18 pairs and contain no peaks. 

Figure 7 summarizes the number of peaks that were found altogether, according 
to the definition of peaks given in note 3, and presents the information once for the 
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Figure 5 (left column), l-step discrimination functions for Test A [i e e], and Figure 
6 (right column), l-step discrimination functions for Test I [iy ui]. The x- andy-axes 
identify vowel pairs and percentage scores respectively. 
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A 

B 

C 

D 

E 

F 

G 

H 

I 

J 

K 

L 

M 

N 

O 

P 

13 

15 

8 

7 

5 

6 

10 

10 

4 

5 

10 

9 

8 

8 

6 

4 

English 

Mandarin 

Japanese 

Cantonese 

German 

36 

16 

14 

20 

42 


Figure 7. The number of peaks found (a) in the 16 tests and (b) according to language- 
group. 

individual tests and once for the five language groups. It should be noted that because 
of the overlap in the range of stimuli covered by the eight pairs of tests A-B, C-D... 
O-P, some peaks have in effect been counted twice—e.g., the peak for vowel-pair 
15-16 in German test A coincides with that for vowel-pair 6-7 in German test B. Cer¬ 
tain details of Figure 7 are of some interest, e.g., the fact that the results of the tests 
that cover the primary CVs [i e e a a 0 o u], i.e., tests A, B, G and H, exhibit more peaks 
than are in evidence in the case of the secondary CVs [y 0 oe ce a a y ui], tests C-F. 
(The front-back tests I-P take in both primary and secondary CVs.) Also of obvious 
interest is the discrepancy between the number of peaks in evidence in the English 
and German results and the number of peaks exhibited by the Mandarin, Japanese 
and Cantonese data. However, limitations of space in the present paper prevent us 
from exploring such issues further. 

2.3. discrimination profiles. Given that there is a degree of arbitrariness associ¬ 
ated with any definition of what constitutes a peak, and given the similarity in profile 
between the English, Mandarin, Japanese and Cantonese charts in Figure 5 (despite 
the fact that they differ in the number of peaks present), the possibility suggests itself 
of comparing the results on the basis of the discrimination profile itself. Spearmans 
correlation coefficient depends on the ranking of two sets of values rather than the 
actual values. Bennett et al. (1992) included a table showing Spearmans correlation 
coefficients for l-step discrimination functions for all sixteen tests for the language 
data then available, i.e., the English, Mandarin and Japanese data. The table is repro¬ 
duced here as Figure 8. For each pair of language-groups, the table indicates the 
amount of agreement between their responses to each of the tests: the higher a par¬ 
ticular test is situated in the diagram, the greater the degree of agreement between 
the two discrimination profiles in question. The most interesting feature of this table 
relates to the tests that involved high-low continua, tests A-H. Four of these cover the 
primary CV continua [i e e], [e e a], [u o d] and [o 0 a], whereas the other four cover 
the secondary CV continua [y 0 oe], [0 oe ce], [ui y a] and [yab]. The primary CV 
tests are indicated in Figure 8 with a p and the secondary CV tests with an s on 
the right of the relevant columns. It will be noticed that there is better agreement 
between the responses of the three groups of subjects to the primary CV tests than 
there is in the case of the secondary CV tests. It was obviously of interest to determine 
whether this relationship would hold also for other pairs of language-groups. This 
point will be taken up again in section 3. 
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significance 

level 

English-Japanese 

English-Mandarin 

Mandarin-Japanese 

coefficient 

test 

coefficient 

test 

coefficient 

test 

O.Ol 

0.828 

B [e £ a] p 



0.843 

A [ie e] p 

0.813 

G [u 0 0] p 



0.791 

H [0 3 a] p 

0-754 

0 [a ce t>] 

0.769 

B [e £ a] p 

0.736 

L [0 y 0] 

0.714 

H [0 i a] p 

0-735 

H [0 3 a] p 

0.723 

B [e e a] p 

0.02 

0.616 

M [e oe a] 

0.622 

L [0 y 0] 



0.609 

A [i e e] p 

0.59 

A [i e e] p 



0.05 

0-545 

L [0 y 0] 

0.521 

N [oe a 3] 

0.532 

G [u 0 3] p 



0.518 

K [e 0 y] 

0.48 

i by w] 

0.1 



0.42 

G [u 0 3] p 

0-394 

D [0 oe ce] s 

0-34 

K [e 0 y] 

0-35 

I [y ui u] 

0.238 

N [oe a 3] 

0-34 

P [ce d a] 

0.297 

D [0 oe ce] s 

0.233 

K [e 0 y] 

0.336 

D [0 oe ce] s 

0.265 

E [ui y a] s 

0.216 

0 [a ce d] 

0.3H 

C [y 0 oe] s 

0.167 

M [e oe a] 

0.192 

E [ui y a] s 

0.314 

N [oe a d] 

0.142 

O [a ce d] 

0.14 

C [y 0 oe] s 

0.313 

I [y in u] 

0.13 

F [y a n] s 

0.07 

P [ce d a] 

0.31 

E [ui Y a] s 

0.094 

I [iym] 

0.055 

J [yuiu] 

0.266 

F [y a d] s 

0.076 

C [y 0 oe] s 

0.041 

M [e oe a] 

0.237 

I [iyui] 

-0.224 

P [ce d q] 

-0.11 

F [y a d] s 


Figure 8. Spearman’s correlation coefficients for l-step discrimination functions for all 
16 tests for the English, Mandarin and Japanese subjects. 


significance level 

correlation coefficient 

compared listener-groups 

0.01 

0.637 

English-German 

0.02 



0.05 

0.48 

Mandarin-Japanese 

0.1 




0.356 

Japanese-Cantonese 


0.237 

English-Japanese 


0.195 

Mandarin-Cantonese 


0.094 

English-Mandarin 


0.0003 

German-Mandarin 


-0.03 

German-Japanese 


-0.121 

English-Cantonese 


-0.23 

German- Cantonese 


Figure 9. Spearman’s correlation coefficients for l-step discrimination functions for 
test I [iy iu] for all pairs of language-groups. 

2.4. CORRELATIONS BETWEEN THE RESULTS AND THE VOWEL SYSTEMS. If an experi¬ 
ment such as the present one were to reveal no statistically significant differences 
between the various language groups, it would be reasonable to concur with Stevens 
et al. (1969:12) that ‘listeners’ linguistic experience has essentially no effect upon their 
ability to discriminate small differences in... vowel formant frequencies’. Instead, 
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though, the present experiment has revealed various kinds of statistically significant 
differences between the groups. It is incumbent upon us, therefore, to attempt to 
explain how speakers may have acquired the differences in question, and the most 
obvious candidate for an explanation is that they have acquired them in the course 
of learning the vowel system, or possibly the phonological system more generally, of 
their native language. However, it is by no means straightforward to establish the rel¬ 
evant correlations between the experimental results and appropriate details of the 
vowel systems. In the remainder of this section we hope merely to give some idea of 
the nature of the problem. 

Test I [i y uu] provides a good example of the kinds of differences that our experi¬ 
ment has revealed. First, the l-step discrimination functions for this test (see Figure 6) 
show a single peak in the same position for the German and English subjects and 
no peaks for the Mandarin, Japanese or Cantonese groups. Moreover, this grouping 
of German and English vs. Mandarin, Japanese and Cantonese is indicated also quite 
clearly by Spearmans correlation coefficients for this test, as shown in Figure 9. We 
see, namely, that there is only a 1 in 100 possibility of the degree of correlation 
between the English and German subjects in this test being due to chance, whereas 
the much lower degree of correspondence between either the English or the Germans 
and the other language groups could well be due to chance. Secondly, in terms of the 
overall performance of the Mandarin and the English subjects, this test was the one 
on which the Mandarin subjects performed least well, whereas for the English sub¬ 
jects it was the second easiest test (see Figure 4). 

The discrimination peak for the English and German subjects in test I involves 
stimulus-pair 10-11, which would be located roughly in the middle of [y] rather than, 
say, near the [i]-[y] boundary. Given that German, Mandarin and Cantonese have a 
phonemic opposition between /i/ and /y/, whereas English and Japanese do not, there 
is no obvious reason why precisely English and German should have a discrimination 
peak in this position. 

As for the overall difference in performance on test I by the English and Mandarin 
subjects, one might perhaps initially have predicted the opposite outcome purely on 
the basis of the listeners’ familiarity with the vowel sounds in question: of the sounds 
[i], [y] and [ui], only the first (and a somewhat diphthongal version of it) occurs in 
the relevant dialect of English, whereas all three can be heard in the Beijing dialect of 
Mandarin, as in the morphemes y'l (30+ ways ambiguous in isolation), yu (20+ ways 
ambiguous in isolation) and si (6+ ways ambiguous). However, the vowel [uu] in the 
dialect in question is necessarily preceded by one of the consonantal onsets written in 
pin-yin romanization as ‘s’, c’ and ‘z’; and pin-yin in effect treats [uu] as an allophone 
of the HI phoneme. An alternative prediction, then, would be that phonemic contrasts 
would have more effect on vowel discrimination than the particular range of allo- 
phones that occur. It might seem, namely, a priori, that discrimination peaks could be 
expected to be in evidence at phoneme boundaries but that allophonic differences, 
which can presumably be ignored by listeners, would have no corresponding peaks. 
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This last consideration could be involved in the Mandarin subjects’ superior per¬ 
formance on high-low tests by comparison with front-back tests. Beijing Mandarin 
has only one mid-height vowel phoneme, whose realization varies, depending on its 
context, between [e] in ye (e.g.,‘night’), [a] in wen (‘ask’), [a] in e (e.g.,‘hungry’) and 
[o] in wo (e.g.,‘fertile’). Such widespread allophonic variation could be the reason why 
the Mandarin subjects were less good at hearing front-back distinctions than high- 
low distinctions. 

Finally, the extra peaks in the German chart in Figure 5 could relate to the pres¬ 
ence in German of a four-way phonemic distinction in the [i e e] area of the vowel 
space: cf. bieten, ‘to offer’, bitten ‘to request’, beten ‘to pray’ and Betten ‘beds’. However, 
if diphthongal counterparts are considered, English also has a four-way distinction: 
beat, bit, bait and bet. 

3. conclusion. The remainder of our discussion can be conveniently organized 
around the distinction between what is language-specific and what is universal in 
relation to vowel discrimination, and speech perception more generally. While Ste¬ 
vens et al. (1969:13) acknowledged that the identification of speech sounds as exam¬ 
ples of the phonemic categories of a language reveals language-specific phoneme 
boundaries, they claimed that vowel discrimination depends on the (universal) audi¬ 
tory mechanism and is unaffected by whatever system of vowels listeners have learned 
to produce and identify. More recent writers, e.g., Kluender (1994:188-97) and Rosner 
and Pickering (1994:106-11), redress the balance and acknowledge the importance of 
linguistic experience also for the discrimination of speech sounds, partly on the basis 
of the by now quite extensive research on speech sound discrimination by young 
infants. 

The present experiment has provided evidence of both universal and language- 
specific aspects of vowel discrimination. The similarity between the English, Manda¬ 
rin, Japanese and Cantonese charts in Figure 5 (the l-step discrimination functions 
for test A [i e e]), in terms of both the location of peaks and the overall profile, would 
seem to suggest a universal mechanism. What lends itself more to a language-specific 
interpretation in the case of the German chart in Figure 5 is perhaps not so much the 
presence of the extra peaks that it contains as the sharp contrast between the very 
prominent peaks and the otherwise rather low percentage scores across the whole 
range of vowel pairs. Figure 6 also revealed a striking difference between the English 
and German charts for test I [i y ui] and the charts for the other three language- 
groups, in the form of a prominent discrimination peak for the English and the Ger¬ 
mans in the vicinity of the vowel [y], though we were unable to point to any feature 
of the English and German vowel systems that would explain this shared peak. The 
shared peak on the part of the English and the Germans, by comparison with the 
other three groups, in this one test, is in fact part of a general tendency for the English 
and Germans to exhibit far more peaks than the other groups (see Figure 7). 

It is well known that front vowels with spread lips predominate in languages of 
the world over front vowels with rounded lips; and that back vowels with rounded lips 
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predominate over back vowels with spread lips. Jones’s CV system reflects this fact 
in its choice of the terminology ‘primary’ and ‘secondary’. It has been suggested that 
the favored association of tongue and lip position is ‘consistent with the view that 
talkers articulate in order to provide acoustic products that are maximally distinctive’ 
(Kluender 1994:182). Several details of our experiment are consistent with this uni¬ 
versal tendency. Each language-group performed somewhat better in tests involving 
primary CV continua than secondary CV continua (see Figure 4). Secondly, the tests 
involving primary CVs yielded more discrimination peaks than the tests involving 
secondary CVs (see Figure 7). Thirdly, Spearman’s correlation coefficients revealed 
greater agreement between pairs of language-groups for the primary CVs than for the 
secondary CVs. When this test was applied to all further pairings of the five groups, 
the result was not quite so dramatic as that of Figure 8, where there is no overlapping 
whatsoever of the primary tests and the secondary tests. Nevertheless, a very strong 
tendency remains for there to be closer agreement between the groups in the primary 
tests than the secondary tests. 

We feel that we have only just begun to get to grips with the wealth of data avail¬ 
able to us. In future work a major challenge will be to attempt to tease apart those 
aspects of the data that reflect universal tendencies and those that have a language- 
specific interpretation. 


We are grateful for the financial support provided at various stages of this project by the 
following organizations: the Royal Society, the British Academy, the Sino-British Fellow¬ 
ship Trust, the Central Research Fund of the University of London, and the School of Ori¬ 
ental and African Studies. For help in running the experiment in the various locations we 
are grateful to pupils of Ftaidian High School, Beijing, and graduate students of the Insti¬ 
tute of Acoustics; to Namsun Song, Yoshima Kajiya and students of the Osaka University 
of Economics and Law; to Ingrid Liillau and pupils of the Suitbertus-Gymnasium in Diis- 
seldorf; to Cheung Kwan-hin and students of Hong Kong Polytechnic; and to Adrian Ben¬ 
nett, Gregory Bennett and 14 of their friends. We also gratefully acknowledge the help in 
London of Dick Hayward, Katrina Hayward, Michael Mann and the late Rodney Mellor; 
and the invaluable computational assistance of Adrian Bennett. 

It will be noticed that there are 18, rather than 16, entries for the Mandarin and German 
groups. There is a different explanation in the two cases. With the Mandarin group it was 
decided to run tests H and I both at the beginning and at the end of the sequence of tests 
so that the two sets of results could be compared for consistency. The second run of the 
two tests is shown in Figure 4 with an italicized H or I. Since such repetitions of tests 
added to the length of what was in any case a very long experiment, it was decided not to 
repeat any tests with the other four groups. On the first trip to Germany, however, test G 
was run twice inadvertently and test H was not run at all. (These two tests are recorded on 
opposite sides of the same cassette.) The mistake emerged when the ‘H’ data were being 
analysed: the results were quite out of line with the results of all other tests and other 
groups, e.g., the subjects seemed to have performed better on the l-step pairs than on 
either the 2-step or 3-step pairs. When analysed as a second run of test G (for which the 
random order of the 124 vowel-pairs is different from that of H), the results no longer 
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appeared anomalous. This second occurrence of test G is entered into Figure 4 with an 
italicised G. Since at this stage no German data were available at all for test H, a second 
trip was made to the same school in Diisseldorf to run the two tests G and H with a dif¬ 
ferent, but comparable, group of German subjects. The relevant entries in Figure 4 are 
distinguished by bold-face type. 

The definition may be formulated as follows. Given any of the tests A, B... P, a score is 
recorded at each of the positions 1,2... 18. Each such score represents the percentage of 
correct identifications of the vowel difference out of 32 individual trials. The standard 
error of the difference between any two such scores, s and s 2 , is calculated as 
V((s 1 + s 2 )(200 -s 1 -s 2 ))/8 

and the test-statistic for testing the significance of the difference s - s 2 is 
z = 8 (Sj - s 2 )/V(( Sj + s 2 )(200 - Sj - s 2 )) 

A score at a particular position is identified as a peak if it is significantly greater (at the 
5% level) than the score at each of the neighboring positions (or significantly greater than 
the score of its only neighbor if the position is 1 or 18, at the end of the range). A pair 
of neighboring scores is identified as a combined peak if they do not differ significantly 
from each other but are each significantly greater than their other neighbors. A triple of 
neighboring scores is identified as a combined peak if they do not differ significantly from 
each other but the two outer ones are significantly greater than their other neighbors. Such 
a procedure provides a systematic way of identifying peaks along the range of positions. 
Flowever, it should only be regarded as a reasonable guideline. It does not guarantee to 
find all of what might be regarded intuitively as peaks, nor is it totally free from the risk of 
picking up spurious peaks caused by random sampling error. 
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PERCEPTION AND PRODUCTION IN A SERIES OF RELATED MERGERS 


David Bowie 
Brigham Young University 


In 1972, Labov, Yaeger, and Steiner noted the existence of what have been called 
‘apparent mergers’ or ‘near-mergers’—cases in which vowel classes are produced dif¬ 
ferently by members of a speech community but at the same time are perceived by 
them as being the same 1 . Since then several studies have confirmed that near-mergers 
do exist (for example Janson & Schulman 1983; Di Paolo & Faber 1990; Labov et al. 
1991; Kontra 1993; Faber & Di Paolo 1995; Bowie 2000). In addition, evidence has 
been found that similar effects can be found in second language acquisition (Shel¬ 
don & Strange 1982) and that the concept can be used to explain certain puzzling phe¬ 
nomena in historical linguistics (Nunberg 1980). It should be noted that the evidence 
listed here is evidence for a rather counterintuitive proposal, as it posits that speakers 
are able to produce contrasts they are unable to hear. 

This paper presents a report of results from fieldwork in Waldorf, a medium-sized 
town in Southern Maryland, in order to confirm whether mergers in perception actu¬ 
ally do precede mergers in production in the linguistic situation there. Waldorf is a 
community of about 50,000 people located twenty-three miles to the south-southeast 
of Washington dc; it has historically been a rural community but is in the process of 
changing into a suburban one. Previous dialectological studies have placed Waldorf 
either in the Southern dialect region or at the border of the Southern and Midland 
regions. This town was chosen as the site for the current study in part because previ¬ 
ous fieldwork there had shown that there are mergers in progress among some of the 
pre-lateral non-low back vowels (specifically, the vowels in the words pole, pull, and 
pool). The vowels in these words have been found to be merged only in the pre-lateral 
context in Waldorf; this is unsurprising, as following III is a context that generally 
promotes merger. This situation presents an opportunity to test the hypothesis that 
mergers in perception precede mergers in production not just in situations where the 
merger is universal, but also in cases in which it is a conditioned merger. 

1. methodology. Fieldwork among Waldorfians was conducted between the spring 
of 1998 and the fall of 1999. In this study, twenty-nine native Waldorfians underwent 
commutation tests to determine the progress of the mergers in the pre-lateral non- 
low back vowels in both perception and production. The subjects took a series of 
three commutation tests, each comparing a minimal pair of the vowels under inves¬ 
tigation to determine the presence or absence of mergers in perception. To accom¬ 
plish this, each subject was recorded reading three randomly ordered lists of thirty 
words each, each made up of minimal pairs of the words under study (pole/pull, 
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pseudonym 

sex 

year of birth 

pull-pole 

pull-pool 

pool-pole 

Theona 

f 

1919 

merged 

merged 

distinct 

Elden 

m 

1921 

merged 

merged 

distinct 

Rowan 

f 

1929 

distinct 

distinct 

distinct 

Gerald 

m 

1941 

distinct 

merged 

distinct 

Randall 

m 

1942 

distinct 

merged 

merged 

Elise 

f 

1946 

merged 

merged 

merged 

Raymond 

m 

1948 

merged 

merged 

distinct 

Jeri 

f 

1951 

merged 

distinct 

distinct 

Pippin 

m 

1951 

merged 

distinct 

distinct 

Melina 

f 

1954 

merged 

merged 

merged 

Bo 

m 

1956 

merged 

distinct 

distinct 

Paulie 

f 

1957 

merged 

distinct 

distinct 

Tex 

m 

i960 

merged 

merged 

distinct 

Rosa 

f 

1963 

distinct 

merged 

distinct 

Roy 

m 

1965 

merged 

distinct 

distinct 

Niels 

m 

1966 

merged 

merged 

distinct 

Torren 

m 

1967 

merged 

merged 

distinct 

Blake 

f 

1969 

merged 

merged 

distinct 

Charles 

m 

1969 

merged 

merged 

merged 

Capri 

f 

1971 

merged 

merged 

distinct 

Dayne 

m 

1973 

merged 

merged 

distinct 

Joanne 

f 

1977 

merged 

merged 

distinct 

Helen 

f 

1978 

merged 

merged 

distinct 

Deanna 

f 

1979 

merged 

merged 

merged 

Dawson 

m 

1980 

merged 

merged 

merged 

Gus 

m 

1982 

distinct 

merged 

distinct 

Kelly 

f 

1986 

merged 

merged 

merged 

Cherokee 

f 

1988 

merged 

merged 

distinct 

Thane 

m 

1988 

merged 

merged 

merged 


Table 1. Status of mergers in perception by subject. 

pull/pool, and pole/pool). After this, demographic information was obtained from the 
subject, and each subject then listened to the recording of their own voice reading 
the minimal pair lists. The subjects were then asked to mark down which word they 
believed they heard themselves saying while listening to the tape; a pre-printed form 
with checkboxes was provided for this purpose. 

After the commutation tests were completed, the answers given by the subjects as 
to what they believed they heard were checked against a master list. If the subject 
correctly identified all of the words in a particular minimal pair list, then s/he was 
identified as not having a merger in perception for that pair. On the other hand, if 
the subject misidentified any of the words on a particular list, s/he was identified as 
having a merger in perception in pre-lateral environments for the vowel pair in ques¬ 
tion 2 . In addition, the words rated by the subjects in the commutation tests were put 
through a linear predictive coding analysis to provide first and second formant values 
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for each utterance to determine the existence of mergers in production using for¬ 
mant values. In this way, the same utterances were tested for evidence of merger in 
perception and production, and the danger of differences due to style or setting was 
avoided. 

2. results for mergers in perception. Looking at the results of the testing for 
mergers in perception using apparent time reveals that the mergers have proceeded in 
a particular order in this speech community—first pull and pool merged, then shortly 
thereafter pull and pole merged, and finally the merger of pole and pool took place. 
This is shown in Table 1, which gives the results for merger in perception for each of 
the pairs tested, arranged by the year of birth of the subjects from oldest to young¬ 
est. Note that in this table (as in other tables in this paper that use a merged/distinct 
classification) merged means that the items at the top of the column have collapsed 
together to such an extent that they are rated as merged, while distinct means that 
they are not merged. 

Table 1 shows that, while there was some individual variation before (and in some 
cases after) each merger took hold completely, the most significant difference in the 
perception of pull and pole exists between those born in 1963 or earlier and those born 
in 1965 or later 3 . For pull and pool, the most significant split occurs between those 
born in 1957 or earlier and those born in i960 or later 4 , and the merger of pool and 
pole shows a significant split between those born in 1978 and earlier and those born 
in 1979 and later 5 . 

3. results for mergers in production. Rating the production of mergers is not 
quite as clear-cut, as production as measured by formant values is an analog rather 
than a digital system—that is, it’s the result of the relevant placement of several pro¬ 
duced phonemes rather than a single merged/distinct rating for the entire group. 
For the purposes of the current paper, however, a system was set up to make rating 
the subjects’ production as binary as possible while still mathematically rigorous. 
The first and second formant values of each set of produced vowels were subjected to 
a Student’s t test with a null hypothesis that the vowels were merged in production. 
If the results of this statistical measure gave a value of p< 0.05, a standard cutoff value 
for this sort of testing, for either formant, the vowels were considered to be distinct 
in production. A summary of the results for merger in production for all of the indi¬ 
viduals listed in Table 1 is given in Table 2 (overleaf). 

The most obvious thing to be seen in Table 2 is that there is much less merger in 
production than there is merger in perception 6 . Also, the splits separating the speech 
community’s adoption of these mergers in production do not precisely match the 
speech community’s adoption of the mergers in production. The most significant dif¬ 
ference in production of pull and pole appears between those born in or before 1967 
and those born later 7 (as compared to the perceptual merger, which was adopted 
slightly earlier, beginning with those born in 1965). The difference in merging pull 
and pool in production occurs between those born in 1967 and earlier and those born 
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pseudonym 

sex 

year of birth 

pull-pole 

pull-pool 

pool-pole 

Theona 

f 

1919 

distinct 

merged 

distinct 

Elden 

m 

1921 

distinct 

distinct 

distinct 

Rowan 

f 

1929 

distinct 

distinct 

distinct 

Gerald 

m 

1941 

distinct 

distinct 

distinct 

Randall 

m 

1942 

distinct 

distinct 

distinct 

Elise 

f 

1946 

merged 

distinct 

distinct 

Raymond 

m 

1948 

distinct 

distinct 

distinct 

Jeri 

f 

1951 

merged 

distinct 

distinct 

Pippin 

m 

1951 

distinct 

distinct 

distinct 

Melina 

f 

1954 

distinct 

distinct 

distinct 

Bo 

m 

1956 

distinct 

distinct 

distinct 

Paulie 

f 

1957 

distinct 

distinct 

distinct 

Tex 

m 

i960 

distinct 

distinct 

distinct 

Rosa 

f 

1963 

distinct 

distinct 

distinct 

Roy 

m 

1965 

distinct 

distinct 

distinct 

Niels 

m 

1966 

distinct 

distinct 

distinct 

Torren 

m 

1967 

merged 

distinct 

distinct 

Blake 

f 

1969 

merged 

distinct 

distinct 

Charles 

m 

1969 

merged 

merged 

distinct 

Capri 

f 

1971 

distinct 

distinct 

distinct 

Dayne 

m 

1973 

distinct 

distinct 

distinct 

Joanne 

f 

1977 

distinct 

distinct 

distinct 

Helen 

f 

1978 

merged 

merged 

merged 

Deanna 

f 

1979 

merged 

distinct 

distinct 

Dawson 

m 

1980 

distinct 

distinct 

distinct 

Gus 

m 

1982 

distinct 

distinct 

distinct 

Kelly 

f 

1986 

distinct 

distinct 

distinct 

Cherokee 

f 

1988 

merged 

merged 

distinct 

Thane 

m 

1988 

merged 

distinct 

distinct 


Table 2. Status of mergers in production by subject. 

in 1969 and later 8 (later than the parallel merger in perception, which was adopted 
beginning with those born in i960). The adoption of the merger in production of pole 
and pool, it should be noted, does not lend itself well to statistical testing (and there¬ 
fore to comparison with the adoption of the merger in perception) because the only 
person who exhibits the merger is Helen (born 1978). 

4. COMPARING THE OCCURRENCE OF MERGERS IN PERCEPTION AND PRODUCTION. A 

comparison of Table 1 and Table 2 reveals that there is often a mismatch between 
production and perception among those tested. A table showing where the status of 
each merger matched or differed is given in Table 3, where a D represents a distinc¬ 
tion in both production and perception, an M indicates a merger in both perception 
and production, production indicates a merger in production but not perception, and 
perception indicates a merger in perception but not production. 
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pseudonym 

sex 

year of birth 

pull-pole 

pull-pool 

pool-pole 

Theona 

f 

1919 

perception 

M 

D 

Elden 

m 

1921 

perception 

perception 

D 

Rowan 

f 

1929 

D 

D 

D 

Gerald 

m 

1941 

D 

perception 

D 

Randall 

m 

1942 

perception 

perception 

D 

Elise 

f 

1946 

M 

perception 

perception 

Raymond 

m 

1948 

perception 

perception 

D 

Jeri 

f 

1951 

M 

D 

D 

Pippin 

m 

1951 

perception 

D 

D 

Melina 

f 

1954 

perception 

perception 

perception 

Bo 

m 

1956 

perception 

D 

D 

Paulie 

f 

1957 

perception 

D 

D 

Tex 

m 

i960 

perception 

perception 

D 

Rosa 

f 

1963 

D 

perception 

D 

Roy 

m 

1965 

perception 

D 

D 

Niels 

m 

1966 

perception 

perception 

D 

Torren 

m 

1967 

M 

perception 

D 

Blake 

f 

1969 

M 

perception 

D 

Charles 

m 

1969 

M 

M 

perception 

Capri 

f 

1971 

perception 

perception 

D 

Dayne 

m 

1973 

perception 

perception 

D 

Joanne 

m 

1977 

perception 

perception 

D 

Helen 

f 

1978 

M 

M 

production 

Deanna 

f 

1979 

M 

perception 

perception 

Dawson 

m 

1980 

perception 

perception 

perception 

Gus 

m 

1982 

D 

perception 

D 

Kelly 

f 

1986 

perception 

perception 

perception 

Cherokee 

f 

1988 

M 

M 

D 

Thane 

m 

1988 

M 

perception 

perception 


Table 3. Mergers in perception and production by subject. 

One thing that can be seen in Table 3 is that, once a merger progresses through 
both perception and production (as shown with an M), it is almost never completely 
reversed in both perception and production (which would be shown with a D). This is 
to be expected—these are processes of merger, and reversal of merger is an extraor¬ 
dinary occurrence. 

Another thing that comes out of Table 3 is that, although the perception and pro¬ 
duction of the various mergers and distinctions does match in a bare majority of 
cases, these are nearly matched by the large number of cases in which there is a mis¬ 
match. Most notably, however, of the forty-three cases of a mismatch between per¬ 
ception and production, only one of them shows a merger in production and not 
perception (Helens pool and pole). This in itself is quite suggestive evidence that 
merger in perception generally precedes merger in production in this speech com¬ 
munity. However, before the data in Table 3 can be taken as absolute evidence for this 
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speech community that near-mergers exist, and particularly that mergers in produc¬ 
tion consistently follow corresponding mergers in perception in time, a problem in 
the data has to be dealt with. 

5. THE EXCEPTION—MERGER IN PRODUCTION PRECEDING MERGER IN PERCEPTION. 

There are four logical possibilities for the sorts of matches and mismatches in the 
perception and production of mergers that one could possibly find in a single indi¬ 
vidual: a merger in both perception and production, a distinction in both perception 
and production, a merger in perception coupled with a distinction in production, and 
a merger in production coupled with a distinction in perception. Individuals fitting 
all four of these possibilities were found in, for example, Kontras 1993 study of 
near-mergers, and all of these possibilities were also found in this study. This is puz¬ 
zling, however, because unlike Kontras study, the way the current study was set up 
would be expected to eliminate the possibility of a result showing that merger in pro¬ 
duction precedes that same merger in perception, as the subjects were tested using 
their own production. Obviously, if a subjects production was merged, it would be 
impossible to tell them apart—there would necessarily be a merger in perception in 
that case. Helen, however, was able to tell her apparently merged-in-production pull 
and pool apart. 

The answer to this puzzle, obviously, is that since in this study the presence or 
absence of mergers in production is determined solely from first and second for¬ 
mant values, Helen must be differentiating those vowels by means other than first 
and second formant values. A closer examination shows that this is, in fact, the case— 
Helen uses an exaggerated rounding distinction, in that she consistently exhibits an 
extreme rounding of pole (which presumably depresses the third formant) which 
allows her to tell the difference between it and her less-rounded pool. 

This brings up the question of whether there are other such cases, in which indi¬ 
viduals use methods to make distinctions that are not reflected by formant values. 
Upon listening to the production of all of the Waldorf natives, one finds only two 
other cases in which methods of distinguishing a pair of these vowel classes that do 
not involve formant differences are used. One of these is Paulies production of pull on 
the one hand (which she produces with a very short vowel) and pole and pool on the 
other (which she produces with a longer vowel). The other case is Cherokee’s produc¬ 
tion of pool (which she produces with a broken vowel 9 ) versus pull and pole (which 
she produces with unbroken vowels). It should be stressed that these are not cases of 
hypercorrection (with the variation that is found in such cases), but rather that these 
are consistent parts of these individuals’ linguistic systems. 

Two items need to be noted regarding this new data. The first is that Paulies and 
Cherokee’s unexpected production appears to serve no purpose—the values of the 
first and second formants are quite enough to tell them apart 10 . The second is that it 
seems rather odd that these three individuals have adopted three such different meth¬ 
ods of differentiating these vowels aside from the manipulation of formant values. 
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These two puzzles can be dealt with by pointing to the same fact—all three of these 
people (Helen, Paulie, and Cherokee) produce at least some of the vowels they dif¬ 
ferentiate oddly very close together. It is only in Helens case that the statistical test 
of production results in a claim that a pair she differentiates in an unexpected way 
is actually produced the same according to formant values, but Paulies production 
of pull and pool and Cherokee’s production of pole and pool both overlap physically 
(though not significantly). 

This points to a possibility that will have to remain a conjecture for the moment, 
but that solves this puzzle quite neatly, which is based on the fact that all three of these 
cases occur on the cusp of a merger in production (Bowie 1998). It may be that, 
when the spaces occupied by two vowels start to collapse so that the vowels may 
merge in production, the individual learner’s linguistic system has the option of 
focusing on secondary differences. This may occur even to the point of changing the 
features of one or both vowels so that they can remain separate in production while 
their formant values merge. This conjecture may seem strange when all of the speak¬ 
ers involved are participants in the merger in perception of at least some of the con¬ 
trasts involved in this, but Nunberg (1980) and Di Paolo (1992) have found that a 
contrast in production is enough to allow the individual to learn to differentiate a pair 
of vowel classes. This is the case even for a case of contrast in production without 
an accompanying a contrast in perception. The fact that all of the methods used in 
such a way are unique points to the possibility that these were all abortive attempts to 
change the system, and therefore they are the sorts of things that would only be found 
sparsely scattered about in the speech community. 

6 . CONCLUSIONS AND DIRECTIONS FOR FUTURE RESEARCH. The lUOSt obvious thing tO 

report from this study is that yet another case has been found in which mergers in 
perception precede mergers in production temporally, thus providing still more evi¬ 
dence that this is a general process by which mergers spread. The data that has been 
presented, however, shows some items of interest that go beyond that. 

I have previously reported on mergers in perception preceding mergers in produc¬ 
tion (Bowie 1998,1999). In conducting this research I have found the need for great 
methodological caution. It is tempting, when comparing mergers in perception and 
production, to simply compare results for perception with a simple test for merger 
in production (such as, for example, F1-F2 plots or native speaker judgments). The 
discussion of the case in this study in which merger in production appeared at first 
glance to lead merger in perception, as well as the other two cases in which produc¬ 
tion involved issues aside from first and second formant values, point out the prob¬ 
lems inherent in such an approach. That said, though, there is a danger in going to 
the opposite extreme. As noted earlier, Helen exhibits an extreme rounding of pole. 
Helen was able to make use of the rounded/unrounded contrast for pole and pool to 
differentiate them in perception, but she couldn’t do this for pole and pull. The ques¬ 
tion arises—should studies such as this one consider pull and pool as being produced 
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differently? Similarly, if they should, at what degree of difference should the cutoff line 
be? Future work should deal directly with these issues. 

Finally, this paper is the latest report to confirm that mergers in perception pre¬ 
cede mergers in production in various environments; one could certainly argue that 
it is a conclusion that is now established. However, this is only one part of the larger 
issue of matches and mismatches between speech perception and speech produc¬ 
tion. A vital issue in the perception/production issue is whether this mismatch occurs 
in situations other than mergers. One has to wonder whether there is a similar 
perception/production mismatch when phonemic classes split. If the answer is no, 
then work will have to be done to explain why exactly there is such a difference—but 
if the answer is yes, linguistics will be a step closer to developing a theory that can 
account for all types of phonetic change. In addition, finding out whether there is 
a perception/production mismatch in such spheres as syntactic and lexical change 
is equally important. Such research would have the effect of helping to determine 
whether all transmission of linguistic change functions in the same way, or whether 
the human mind really does deal with different classes of linguistic processes in very 
different ways. 


1 This is a revised and expanded version of a paper presented at the 20th Deseret Language 
and Linguistics Symposium in Provo, Utah on 6 April 2000. Many thanks to Hikyoung 
Lee and two anonymous reviewers for comments on this paper. 

2 This is a very strict standard—even just one misidentified word resulted in the subject 
being labeled as having a merger in production. The rationale, however, is that if there is a 
robust distinction between two phonemes (as an example, consider the initial consonants 
in mat and cat), there would be absolutely no confusion between them. 

3 78.57% of the older group merges the sounds versus 93.33% of the younger group, verified 
as significant by a chi-square test to a level of p<o.oooi. 

4 With 58.33% of the older group merging the sounds and 94.12% of the younger group, veri¬ 
fied by a chi-square test to a level of p<o.ooi. 

5 13.04% of the older group merges the sounds and 66.67% of the younger group, which chi- 
square testing shows to be significant to a level of p<o.ooi. 

6 This is not just an artifact of the method used to determine merger in production—an 
analysis that assumes that any overlap of vowel spaces constitutes merger in production 
also results in the finding that merger in production is more widespread than merger in 
perception (Bowie 1998). 

7 12.50% merged by the older group and 53.85% in the younger group, with a chi-square test 
showing this split significant to a level of p<o.oi. 

8 The younger group merges these 5.88% of the time, the older group 25.00% of the time; 
chi-square testing declares this significant to a level of p<o.oooi. 

9 By a ‘broken vowel’ I mean a vowel broken in two by a reduction in amplitude, so that 
pool seems to be produced somewhat more like [pu.ul] than the unbroken [pul], though 
without the actual full syllable break shown. 

10 It should be noted that these individuals’ ability to tell vowel classes apart using data 
other than formant values appears limited even in these cases. For example, Cherokee 




PERCEPTION AND PRODUCTION IN A SERIES OF RELATED MERGERS 


305 


consistently produces pool with a broken vowel and pole with an unbroken vowel, but she 
is unable to access this to distinguish those two words in perception. 
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PHONEME AND GRAPHEME: HOW PARALLEL CAN THEY BE? 


David G. Lockwood 
Michigan State University 


introduction. In one of the classic papers on the linguistic treatment of writing, the 
British linguist C. E. Bazell (1956) noted that published literature contained two alter¬ 
native conceptions of the grapheme. In one of these ‘the graphic shape of an allograph 
is dependent on its graphic surroundings’ (Pulgram 1951, as quoted by Bazell [359]). In 
the other view, found in a study of Old English writing by Stockwell and Barritt (1951) 
‘two graphs are allographs of the same grapheme if they represent the same phoneme, 
whether or not they stand in relations of complementary distribution’ (Bazell 359). So 
in the first conception <ph> would be the same sequence of two graphemes in graph 
as in uphill, and in both cases utterly distinct from the <f> in fine. In the Stockwell- 
Barritt view, on the other hand, <ph> in the first example would be a coallograph 
with <f> (along with the <gh> of rough and laugh), and this would be utterly distinct 
from the other usage, where <p> and <h> would be separate graphemes by virtue of 
representing different phonemes. 

More recently, the German scholar Manfred Kohrt (1986), has given names to 
these alternative conceptions. The first is termed the analogical view, because it 
makes the grapheme-allograph relation parallel to that between phoneme and allo- 
phone. The alternative is called the referential view, because it is based on the 
phonological reference of the various graphemes. 1 

1. evaluation of views. In evaluating these views, it must first be said that students 
of writing need to include both kinds of relations in any model they adopt. Any rea¬ 
sonable overall model, in other words, must have a place to treat both the variant 
forms of a ‘letter’, character’ or other graphemic unit and a way to discuss and sys¬ 
tematize as much as possible the referential relations that exist between written and 
spoken language. Except in the special case of a perfect one-to-one relation between 
units of writing and spoken language, however, these cannot be treated all at once. 
They will involve separate sections or components of the overall graphonomy. 

That said, the analogical view is the one that makes the most sense, as long as it is 
incorporated into a model that also allows one to relate writing to spoken language in 
a systematic way. Two major reasons for preferring this view are as follows: 

• independence. Certainly no respectable linguist would propose that we make 

our analyses of sound systems dependent on writing. In the same way, it seems 
undesirable to make our analyses of writing dependent on phonology. 
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• comprehensiveness. If a grapheme is defined as a set of ways of represent¬ 
ing a phoneme, it would seem impossible to include under the grapheme 
writing systems not based on sound, or even those based on syllables rather 
than segments. 2 

2. remaining problems. Having decided which fundamental path to follow, how¬ 
ever, one must still consider two serious problems, and both involve certain ways in 
which graphonomy is less than ideally parallel to phonology. 

2.1. first problem: recognizing coallographs. One problem concerns how we 
recognize which physically different written shapes are to be assigned to the same 
grapheme. The referential model suggests an automatic solution to this problem, even 
though it may have fairly bizarre consequences. In following the analogical model, 
it is natural to look to parallelisms with phonology. Since the classic presentation of 
phonemic methodology by Morris Swadesh in 1934, the assignment of sounds as allo- 
phones of the same phoneme has been based on three key principles: 

1. Phonetic similarity 

2. Non-contrastive distribution 3 

3. Pattern congruity 

It should be clear that non-contrastive distribution and pattern congruity can read¬ 
ily be applied in graphemic analysis: coallographs must indeed occur in such a way 
that they do not contrast in a given writing system, and alternative conceptions can be 
evaluated for their relative contribution to the regularity of patterning, causing one to 
prefer a solution which contributes more to such regularity over one that contributes 
less. But what about the analog of phonetic similarity, namely graphic similarity? Is 
this a reasonable requirement for marks treated as coallographs? 

Consideration of familiar facts about alphabetic writing suggests that no require¬ 
ment of graphic similarity would be reasonable. Consider capital and lower-case 
characters in the Roman and similar writing systems. Now it is true that capital and 
lower case letters are not simply coallographs, because in modern writing and print¬ 
ing they are functionally contrastive. Corresponding capital and lower-case forms 
must be treated as graphemically distinct, but that does not mean that they are simply 
different graphemes. Rather, following the practice of Gleason (1961) and Herrick 
(1966), we need to view capitalization as involving a suprasegmental grapheme pres¬ 
ent in addition to the letter grapheme. So any differences of shape under capitaliza¬ 
tion can be seen as analogous to allophones of vowel or consonant phonemes under 
different conditions of accent or tone. Thus, allography would clearly be involved. 
In the modern uses of the Roman and Greek alphabets (termed Neo-Roman and 
Neo-Hellenic by Herrick 1974) capitals differ from corresponding lower-case letters 
in relative size, but often differ in shape as well. Tet us consider the situation in the 
three major modern scripts that use capitalization, the Neo-Hellenic (as for Modern 
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Table 1. Representations of some Inuktitut CV syllables in the Evans script. (Based on 
Nichols 1996) 

Greek), the Neo-Roman (as for English), and the Neo-Cyrillic (as for Russian) 4 . In 
each of these, there are letters that exhibit no strong resemblances of shape between 
the two types, alongside others that do. 5 For the Greek alphabet, half of the 24 letters 
show distinct shapes: A/a, B/( 3 , 17 y, A/6, Z/(, H/q, M/p N/v, Elf Z/a, Y/u, fl/co. The 
others show identical or closely resemblant shapes: E/e, 0 / 9 , III, K/k, A/A., O/o, IT/tt, 
P/p, T/t, ®/<p, X/y, T/ry In the Roman, the proportion is smaller but still consider¬ 
able, with the following ten letters having distinct shapes: A/a, B/b, D/d, E/e, G/g, H/h, 
L/l, N/n, Q/q, R/r. The other sixteen do not have distinct shapes—C/c, F/f, I/i, J/j, K/k, 
M/m, O/o, P/p, S/s, T/t, U/u, V/v, W/w, X/x, Y/y, Z/z. In the Cyrillic, we find only three 
of the 32 letters with distinct capital shapes—A/a, E/6, E/e—while the others have 
almost the same shapes, with capitals differing only in size and a few minor details: 
B/b, r/r, fl/n, >K/>k, 3/3, M/n, M/ii, K/k, JI/ji, M/m, H/h, O/o, n/n, P/p, C/c, T/t, Y/y, 
®/ 4 >, X/x, If/p, H/h, Ill/m, III,/m, "E/i), EI/m, E/b, 3 h, KD/10, R/n. 6 

These examples illustrate that coallographs do not have to show graphic similar¬ 
ity, though obviously they may. In this sense, coallography is more closely parallel to 
coallomorphy than it is to coallophony: allomorphs of the same morpheme are often 
phonemically similar, but in line with the usual view, they do not have to be similar 
at all. Without a counterpart of phonetic similarity, however, there may be problems 
in deciding how to group graphic shapes as allographs: we can use similarity when it 
occurs, but we may not always have it available. 

As a further illustration, let us consider some uses of the Evans script for some 
languages spoken in northern Canada. This script was originally devised in 1840 by a 
missionary named James Evans for the Ojibwe of Ontario and a form of Cree spoken 
in the vicinity of Norway House, in what is now Manitoba. According to the account 
in Nichols 1996, forms of this writing have continued to be used for various dialects 
of Cree and Ojibwe, and the same basic system has been modified for use with some 
Inuit 7 languages and some sub-Arctic Athabaskan languages as well. 

Nichols’s account refers to this system as a syllabary, and apparently Evans him¬ 
self conceived of it in such terms. Its structural organization, however, differs from 
that of an undoubted syllabary such as Cherokee or Linear B because the shape of a 
character indicates the initial consonant, while vowel differences are indicated by the 
orientation assumed by that shape. See Table 1 for some examples, based on the Inuk¬ 
titut orthography established by the Inuit Cultural Institute. This shows three vowel 
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qualities, which are all that are needed for Inuktitut. For Algonquian languages, four 
orientations are generally needed. 

In the same book that contains this article, Peter Daniels (1996) expanded the tra¬ 
ditional typology of writing systems to include some additional types beyond the 
traditional logography, syllabary, and alphabet. In particular he includes a category 
called an abugida, which was intended to cover systems like Ethiopic (from which 
the name originates) and various Indian writing systems such as Devanagari. While 
neither Daniels nor Nichols makes this suggestion, it seems reasonable to conclude 
that the Evans system exemplifies a variety of the abugida rather than a syllabary. The 
definition of an abugida (ibid 4) states that in such a system ‘each character denotes 
a consonant accompanied by a specific vowel, and the other vowels are denoted by a 
consistent modification of the consonant symbols’. It is not clear that consistent mod¬ 
ification is intended to be broad enough to include the orientational distinctions 
central to the Evans system. It is at least somewhat different in that there is no obvi¬ 
ously unmarked variety, such as the characters for consonant plus short /a/ in San¬ 
skrit Devanagari. It is proposed here that Daniels’ otherwise very useful typology 8 
be modified so as to allow an abugida to be either diacritic (like the Ethiopic and 
Devanagari and other Indian systems), or orientational (like the Evans system) 9 . 

The general graphemic analysis of an orientational abugida such as is found in 
those writing systems based on the Evans script should, it would seem, distinguish 
graphemes of basic shape, which symbolize the consonants, from the orientational 
factors which distinguish the vowels, so each form associated with a consonant + 
vowel syllable would be seen as simultaneously embodying a grapheme of shape and 
another of orientation. Graphemically, this is quite different from a true syllabary, 
like the Cherokee, whose typical graphemes represent particular consonant + vowel 
sequences as wholes. It is also different from an alphabet, which has graphemes of dif¬ 
ferent shapes for consonants and vowels. Another factor which must be considered in 
relation to the Evans script is the symbolization of consonants that stand in syllable- 
final position, not followed by any vowel. Generally, the device adopted for these is a 
symbol of reduced size and suprascripted position placed to the right of the character 
representing the onset and coda. In the forms of the script used for Inuit languages 
and for the more easterly Algonquian languages, these suprascript consonants share 
the same shapes as the non-suprascripted consonantal form, and are placed in the 
orientation that would indicate a following /a/ in a non-suprascripted occurrence. 
It seems reasonable in such a case to abstract suprascripting as another graphemic 
property at work here, symbolizing the lack of a following vowel. (This compares 
functionally to the usage in Sanskrit Devanagari, where an explicit stroke [termed 
the virama] indicates that no vowel is to be pronounced after the consonant.) This 
would also allow us to consider the orientation associated with /a/ or no vowel to be 
the unmarked one, suggesting that only the orientations associated with other vowels 
require additional simultaneous orientational graphemes. An analysis for a represen¬ 
tative portion of the Inuktitut version along these lines is presented in Table 2. 
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Table 2. Some Inuktitut uses of the Evans script with graphemic analysis. 
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Tables. Varieties of the Evans script in representing syllable-final consonants in 
Algonquian languages. (Based on Nichols 1996) 

When we consider the versions of the Evans script in use for more westerly Algon¬ 
quian languages (generally Cree dialects spoken to the west of James Bay), however, 
we find a situation that is roughly analogous, but is different in one crucial detail: 
the shapes found in suprascripted position for a particular consonant are not relat- 
able in any obvious way to those indicating corresponding onsets, as illustrated in 
Table 3 10 . This could be treated as a matter of allography, with special shapes occur¬ 
ring when accompied by the suprascripting grapheme, just as Roman, Greek, or Cyril¬ 
lic letters accompanied by a grapheme of capitalization may assume different shapes, 
sometimes without an obvious similarity to their other shapes. There is, however, a 
crucial question affecting the acceptability of such an analysis. It would be acceptable 
if we can consider aspects of the function of a grapheme in relation to phonemes (or 
other units of a spoken language) in deciding between logically possible graphemic 
analyses, but not if we are prohibited from doing so, in the way a phonologist surely 
ought to be prohibited from deciding between alternate phonemic analyses on the 
basis of any writing system that might be in use for the language. 
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Name 

Denotation of characters 

Examples 

LOGOSYLLABARY 

Individual words (or morphemes) as well 
as particular syllables 

Chinese 

Sumerian 

SYLLABARY 

Particular syllables (with no systematic 
graphic similarity between characters for 
phonetically similar syllables) 

Linear B 

Cherokee 

Kana 

CONSONANTARY 

(or abjab) 

Consonants only 

Phonecian 

Hebrew 

ALPHABET 

Consonants and vowels 

Greek 

Roman 

Cyrillic 

abugida: 

Consonant accompanied by a specific vowel, 
with other vowels denoted by a consistent 
modification of the consonant symbols 

Ethiopic 

Devanagari 

(Evans?) 

FEATURAL SYSTEM 

Distinctive features of segments 

Han’gul 

Pitman shorthand 


Table 4. Types of writing by units represented according to Daniels 1990,1995. 


It seems better to allow the consideration of the reference of a graph, then, when it 
helps us to decide a case like this. One alternative would be not to associate the coda- 
consonant graphs with the others at all, and the other would be to associate them in 
a totally arbitrary way. 

2.2. SECOND PROBLEM: GENERAL APPLICATION OF THE GRAPHEME CONCEPT. An 

advantage to the general adoption of the analogical rather than the referential view of 
the grapheme is that it allows the idea of the grapheme to be used with any type of 
writing system, whether or not it has reference to the phonology of the correspond¬ 
ing spoken language. With this broadening, however, come problems of how to apply 
the notion in writing systems which relate to spoken languages in various ways. Dan¬ 
iels’ typology of these relationships (summarized in Table 4) marks a definite advance 
over the traditional one. When it is clarified to allow both diacritic and orientational 
varieties of abugida, it gives six different varieties of phonographic writing, plus the 
additional category for the Chinese type, which Daniels terms a logosyllabary 11 . 

In virtually any writing system, however, there is a hierarchical arrangement at 
least roughly parallel to hierarchies seen in phonology and other strata of a spoken 
language. In the Roman alphabet we have, for instance, written words composed of 
letters, and letters composed of various kinds of loops, curls, bars, and the exten¬ 
sions termed ‘ascenders’ and ‘descenders’ by printers. Accounts of the Chinese writing 
system (e.g., Coulmas 1989:95-96) tell us that full characters are composed of one or 
more base characters, and the Chinese tradition gives us a further analysis of the base 
characters into strokes of just eight basic kinds. The problem is deciding just where in 
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Name 

Denotation of primary graphemes 

SYLLABARY 

Syllables: usually a CV combination, but also a vowel alone; 
or a more complex syllable. 

CONSONANTARY 

(abjad) 

Consonants. 

ALPHABET 

Consonants and vowels 

abugida: 

DIACRITIC 

Consonant plus a vowel treated as unmarked in the writing 
system, supplemented by diacritics indicating marked vowels or 
the absence of a vowel. 

abugida: 

ORIENTATIONAL 

Consonant (or the absence of a prevocalic C), but typically 
combined with graphemes realized in terms of orientation 
specifying the vowel accompanying (or its absence). 

FEATURAL SYSTEM 

Properties of segments, combinable with one another in specified 
ways to indicate a whole segment, or with secondary graphemes 
manifested by more abstract properties such a repetition or 
shading. 


Table 5. Types of phonographic writing as interpreted in terms of primary graphemes. 

such a hierarchy to apply the grapheme concept. Are the graphemes of English writ¬ 
ten words, letters, or the subparts of letters? Are those of written Chinese full charac¬ 
ters, base characters, or strokes? 

It is suggested here that this problem can be solved if we consider the relations 
between written and spoken language to some extent. Specifically, it is proposed that 
each writing system has a set of primary graphemes, definable on the minimum hierar¬ 
chical level that bears a systematic relation to the spoken language. Table 5 shows how 
this applies to the various kinds of phonographic writing, recognizing that there will 
also be other kinds of graphemes, including suprasegmental and diacritic ones. 

For the Chinese type, it appears that the base character would be the most obvi¬ 
ous candidate for the status of primary grapheme. Strokes would be clearly excluded 
due to their lack of relation to the spoken language (any exceptions found being treat¬ 
able as diacritic graphemes). Of the remaining candidates, the base character seems 
to be the best choice because it does typically relate to the spoken language, though 
this relation can be quite complex, sometimes involving a graphic kind of idiomatic- 
ity. Some writing systems, such as that of Japanese, are compound in that they com¬ 
bine distinct sets of primary graphemes of different types in one overall system. So 
the Kanji part of Japanese writing would be subject to essentially the same analysis 
as Chinese, but the Kana (Katakana/Hiragana) would have primary graphemes of a 
syllabic type. 

3. conclusion. In summary, it is proposed that the proper way to look at writing 
systems linguistically involves adoption of the analogical view in preference to the 
referential one, with the proviso that phonology may be considered for two purposes: 











314 


DAVID G. LOCKWOOD 


(1) classification of graphs as coallographs in the absence of graphic similarity, and 

(2) determination of the hierarchical level appropriate for the primary graphemes of 
a written language. 

This view may profitably be compared to positions presented in previous lacus 
Fora by two experienced specialists in the study of writing, Peter Daniels and Earl 
Herrick. Daniels (1992) began the discussion by arguing against a structural graphe- 
mics, while Herrick (1995a) replied in support of the concept. From a reading of their 
positions, however, it seems evident that they were arguing in relation to different 
conceptions. Daniels fundamentally argued against the referential grapheme, while 
Herrick argued in favor of the analogical view. Both agreed, in other words, on the 
infeasibility of the referential conception. Herrick presented the view, however, that 
a structural graphemics based on the analogical conception was indeed feasible. In a 
reply to this, Daniels emphasized that he did not view writing as a form of language, 
so he as a linguist was interested almost exclusively in the study of writing in relation 
to speech. Herrick’s conception of a structural grapheme based on the internal rela¬ 
tions of writing was therefore of little interest to him. 

The view presented in this paper is closer to that of Herrick in seeing a structural 
concept of the grapheme based on the analogical view as a valuable one. At the same 
time, it recognizes the importance of studying the relations between written and 
spoken forms of language, and finds some consideration of these relations of refer¬ 
ence to be valuable in solving some otherwise difficult problems of analysis. 

While the proposal to depart from a strict analogical view to allow a limited con¬ 
sideration of reference for special purposes does in fact interrupt the strict parallelism 
with phonology, it does not seem so unreasonable when we consider the asymmetry 
that exists between phonology and graphonomy. While every spoken language has a 
phonology by definition, only some have an associated orthography. While the idea 
of a written language not associated with a spoken language does not seems theoreti¬ 
cally unimaginable, it occurs in practice only in unusual cases, such as when we learn 
to understand an ancient written language we cannot pronounce, or when a deaf 
person unable to control the spoken form of a language still learns to read and write. 
In view of this practical asymmetry, the methodological asymmetries suggested here 
as appropriate do not seem unreasonable. Furthermore, this proposal is not made 
arbitrarily, but as a solution to genuine analytical problems. 


Herrick i995a/b suggests the term ‘graphemic grapheme’ for a grapheme according to the 
analogical view vs. ‘phonological-fit grapheme’ for one conceived according to the refer¬ 
ential view. Herrick admits these terms are somewhat ad-hoc, and the first also seems 
to prejudge any comparison between them, suggesting that graphemes of the sort Her¬ 
rick favors are ‘the genuine article’. Kohrt’s terms are therefore preferred in any discussion 
comparing these views. 

In the chapter on Writing and Language in Anttila 1972 the notion of graphemics sug¬ 
gested is explicitly restricted to the study of alphabetic writing (33). This statement can be 
puzzling to a reader whose acquaintance with the grapheme is based primarily on the two 
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chapters on writing in Gleason 1961. Classroom use of Anttila’s book as the primary text 
in teaching historical linguistics always required the present writer to comment about this 
restricted view of the grapheme. It now seems apparent that Anttila had been influenced 
strongly by the use of grapheme/allograph in the Stockwell and Barritt referential tradi¬ 
tion. Indeed the historical linguist would be primarily interested in writing as evidence for 
the phonology of ancient languages, and so would have a great interest in the referential 
relations, but this would not seem to justify adoption of that view overall. 

3 Swadesh, of course, spoke of complementary distribution’. The use of the amended term 
reflects refinements made later, specifically in Hockett 1947. The discussion there con¬ 
cerned morphemic rather than phonemic analysis, but the same general principle also 
came to be applied in phonology. 

4 The use of capitals characterizes most modern alphabets derived from the Greek, though 
it was not present in the older forms of Greek or Roman. Other scripts and script-variants 
showing this usage are Armemian, Coptic, some forms of Irish, and Fraktur (the so-called 
‘black letter’ form of Roman common in German through the first half of the 20th Cen¬ 
tury). See Herrick 1999 for examples of an unsuccessful 1930s proposal to extend capi¬ 
talization and italicization into Hebrew, intended as a way to modernize that writing 
system. 

5 Obviously the resemblances are matters of degree. These divisions are based on the 
author’s best judgement. 

6 Cyrillic shows a fairly striking difference from its alphabetic relatives in this regard. This 
lack of difference makes printed Cyrillic resemble Roman printing that uses small capitals 
and precludes the use of small capitals for special uses in Cyrillic. There are greater differ¬ 
ences, however, when handwritten rather than printed Cyrillic forms are considered. 

7 In 1999 the Canadian government accorded special recognition to this writing system in 
connection with the establishment of the Inuit territory Nunavut, carved from a portion 
of the Northwest Territories. Both a special 46-cent stamp and a special $2 coin issued in 
commemoration of this occasion contained the name of the territory in both Roman and 
the Evans script. 

8 As Herrick 1995a points out, this typology is based on the fit of the writing system 
with the spoken language, rather than on internal properties of the writing system itself. 
This should not disparage its general usefulness, however, in organizing accounts of the 
written/spoken relations on which it focuses. 

9 Some uses of the Evans system combine certain diacritics with orientational differences 
to accommodate richer sets of vowel distinctions. A raised dot, for instance, is used in the 
Inuktitut form to indicate that the vowel involved is long rather than short (Nichols 1996). 
So this overall system has features of both the orientational and the diacritic subtypes 
of abugidas. Indeed the introduction of such diacritics would seem to be the only way 
to devise a referentially adequate orientational abugida for a language with more than 4 
distinct vowels. This is relatively easy when there are sets of vowels distinguished by such 
properties as length, height, frontness, rounding, tongue advancement, or nasalization. 

10 In principle, the shapes the Western Algonquian coda allographs appear to derive from 
Pitman shorthand, though not all details of usage are identical. For instance, both systems 
use short dashed lines in different orientations to symbolize stops, but with differences of 
reference. 

11 This terminology shows the influence of the idea promoted in DeFrancis 1989 treating 
Chinese writing as a rather complex, morphologically influenced sort of syllabary rather 
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than a logographic or morphographic system as has been supposed more traditionally. 
For a contrasting argument, see Sampson 1994. 
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DERIVING OT CONSTRAINTS FROM RELATIONAL NETWORK THEORY 


William J. Sullivan 

University of Florida and Uniwersytet Wroclawski 


introduction. ‘Generative’ phonology 1 , so called because it was intended as a pho¬ 
nological module compatible with the transformational-generative syntax introduced 
in Chomsky 1957, was first incarnated in Halle 1959. Halle 1959, based on a part of 
Jakobson, Cherry & Halle 1953, is limited to the production of‘ideal phonetic forms’ 
of the chains of lexical items that make up the output of a Syntactic Structures model 
of syntax. During the 60s generative phonology grew more complex, resulting in 
the kind of description provided in Chomsky & Halle 1968. By the mid 70’s other 
descriptive models began to be introduced in order to account for phonological phe¬ 
nomena that were as yet unexplained in the generative family of theories (hence¬ 
forth C-phon) 2 . Among these later models are autosegmental phonology, metrical 
phonology, and CV phonology. Yet problems remained, many of them 3 artifacts of 
Chomskyan theory. Some post-1985 C-phon theories have had good work published 
in them. Underspecification (US) faces a question first asked in Jakobson, Cherry & 
Halle 1953, i.e. what is the minimal number of features needed to specify the pho¬ 
nemes of a language 4 . Feature Geometry (FG) faces questions also asked there, e.g., 
what is the principled, structural relation between features? Sullivan 1998 shows that 
US and FG, which are distinct theories in the C-phon family, are implicit in and 
derivable from a well-developed relational network (RN) theory. That is, they are the¬ 
orems of an RN theory. 

An even more recent theory in the C-phon family is Optimality Theory (OT 5 ). 
Again good work has been done in this theory. Milnes and Wiltshire 2000 (hence¬ 
forth MW) provide an OT description that predicts stress in certain Russian para¬ 
digms and Gavrin 1999 provides an OT description of coda position in the Korean 
syllable. Sullivan (ms.) describes the same Russian problems in a generalized RN 
description, executed in a logic-based model of cognitive-stratificational theory, and 
the present study describes the Korean problems attacked in Gavrin 1999. Taken 
together, the two articles demonstrate that the OT constraints of MW and of Gavrin 
1999 are implicit in the RN description. This indicates that OT, too, is a theorem of an 
RN theory. 

1. rn descriptions. Cognitive-stratificational theory is not the oldest RN theory but 
it is the best developed. The networks described are three-dimensional and consist 
of lines connected at nodes. The nodes may be defined in neurological terms, as in 
Lamb 1999. However, I am using a coarser model, as my main interest is in the logic of 
the architecture that accounts for these examples. In this model the nodes are defined 
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a 



b 


Figure 1. Logical relations of a phoneme. 

in terms of the logical nature of the relationship (and, or, precedence). Lines that 
appear to cross without a node are actually two-dimensional projections of lines 
that are skew in three-dimensional space. The triangle shows an and relation, the 
square bracket an or. Spacing between lines on the plural side of a node indicates pre¬ 
cedence in either time (with and) or choice (with or). Here the left-hand line takes 
precedence in order (with and) or markedness (with or). A diamond node is a short¬ 
hand for a combination of or relations that intersect the semiotic (form-function) 
plane with a structural (tactic) plane, here the phonology. A circle at the end of a line 
(see figure 2) indicates zero realization; a circle on a line indicates optional occurrence 
(alternation of what comes below with zero). 

I begin the description with the logic of a phoneme. The graphic description is 
given in Figure 1, which is a maximum potential set of relations for a phoneme. The 
phoneme itself is represented by the little square box in the center of Figure 1, gener¬ 
ally called a diamond. Lines a-b are related to points in the structural (phonotactic) 
part of phonology. Line a relates the phoneme to larger (i.e., hierarchically domi¬ 
nant) structures. These structures define its range. Line b relates it to smaller (hier¬ 
archically subordinate) structures, which define its domain. Conversely, lines c-d are 
in the semiotic (form-function or realizational) part of phonology. Line c is related to 
the set of morphemes that this particular phoneme helps to distinguish from other 
morphemes, and line d is related to its phonetic correlates, initially to the phonemic 
feature(s). None of the Korean phonemes considered below have more than three 
of the four relations, but it is easy to exemplify all four, using phonemic stress. With 
stress, line a relates to Pword and line b to syllable. The stress phoneme is related to 
the phonetic correlates of stress (pitch shift, syllable length, loudness, etc.) via line d. 
Finally, line c relates it to those morphemes distinctively marked for stress. 

2. ot constraints in Korean phonology. The OT constraints relevant to Korean 
phonology and their relative ranking, as described in Gavrin 1999, can be predicted 
from a generalized RN description of the Korean syllable. 6 The structural and real¬ 
izational facts are straight-forward. I begin with a summary of syllable structure in 
Korean. 

The Korean syllable permits four canonical forms: V, CV, VC, and CVC. A VCV 
sequence is syllabified [V CV], where the space indicates the transition between 
syllables. This shows that onset position of syllable i+i takes precedence over coda 
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Gloss 

Maximal form 

Phonetic form 

price’ 

kaps 

kap 

‘soul’ 

naks 

nok 

Tst birthday’ 

tols 

tol 

clothing’ 

OS 

ot 

‘to sit’ 

anc ta 

an fa 

‘to tread on’ 

palp ta 

pap t’a 

‘to recite’ 

ilp h ta 

ip fa 

‘to read’ 

ilk ta 

ik fa 

‘to starve’ 

sulm ta 

sum t’a 


Table 1. Coda alternations in Korean. 

position of syllable i. Only seven consonants are permitted in coda position: unre¬ 
leased oral stops /p t k/, nasals /m n q/, and the apical liquid realized as [l] 7 . 

The focus of Gavrins examples is on what happens with morpheme- and 
syllable-final consonants. They are realized in onset position, when followed by a 
morpheme that begins with a vowel, e.g. -un ‘topic marker. They are realized in coda 
position in word-final position or when followed by a morpheme that begins with 
a consonant, e.g. -ta ‘infinitive suffix’. In onset position the apical liquid is realized 
as tapped [r] and other consonants are realized in their phonemic (or underlying) 
forms. In coda position the consonants have alternate realizations. Some are neutral¬ 
ized with one of the seven permitted consonants, others are realized as zero. 

A set of examples taken from Gavrin 1999, representing her native dialect 8 , illus¬ 
trate these alternations. The examples are summarized in Table 1. Recall that all coda 
stops are unreleased (not indicated in the transcription). C' is a fortis articulation. A 
space in the ‘maximal form indicates two morphemes, a space in the phonetic form 
indicates two syllables. 

Gavrins OT description invokes the constraints in (1), ranked as given (a-e). 

(1) a. ^complex excludes syllable-internal consonant clusters 

b. parse realizes all underlying segments 9 

c. ALiGN-a aligns the right 10 edge of the stem with the right edge of the 
syllable 

d. peripherality ‘parses’ (gives prosodic priority to) labials and velars at 
the expense of apicodentals 

e. coda sonority parses more sonorous segments at the expense of less 
sonorous ones 

As with the ranked constraints in MW’s description of Russian accent, the con¬ 
straints in (1) produce the correct outputs (or, more properly, weed out the worse 
output forms). 
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Syllable 



Closure 

Nasal 


Apical 

(+Closure) 

Lateral 

Tap 


Apical 


3. a rn description of Korean codas. The portion of Korean phonology critical to 
a description of the alternations in Table 1 is given in Figure 2. 

Arranged vertically along the left-hand side of the diagram are the relations 
coming from what might be called the spell-out rules of morphemes. In this case I 
include three obstruents and the resonant that is realized as either a tapped [r] or an 
[ 1 ]. Reading vertically from the top, I include the relevant portion of the syllable struc¬ 
ture, specifically the relations of various phonemes to coda position of the syllable. 

Consider the examples in Table 1, focusing on the realizations of underlying c, t, s, 
and the apical resonant. In onset position c is realized as [c], but it is realized as zero 
in coda position, e.g., [an t'a] ‘to sit’ from anc ta. In all cases t is realized as [t]. More 
varied realizations are found for s: as [s] in onset position (indicated, not shown), as 
[t] in [ot] ‘clothing’, from os, and as zero in [kap] ‘price’, from kaps. The apical reso¬ 
nant has equally varied realizations: as a tapped [r] in onset position (only partially 
shown), as [1] in coda position, e.g. [tol] ‘first birthday’, from tols, and as zero in [ik t’a] 
‘to read’, from ilk ta. These facts are represented by the horizontal relations in Figure 
2, reading from the top to the bottom. 
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Now return to the structural relations. A generalized statement of the canonical 
form of the syllable in Korean is [C] V[C] (an optional consonant followed by a vowel 
followed by an optional consonant). 11 Coda position in Korean accepts only unre¬ 
leased stops, nasals, and the [ 1 ] allophone of the apical resonant. Most liquid-nasal 
and liquid-stop sequences are realized without the liquid, shown by the ordering at 
(2d). But if the liquid is followed by an apical stop, whether from a t-morpheme or 
an s-morpheme, the sequence is realized without the stop, shown by the ordering at 
(2e). These are the vertical relations in Figure 2. The bottom-most diamonds are the 
phonemes that intersect the realizational and tactic relations. 

4. discussion. The RN diagram predicts both the OT constraints and their ranking. 
^complex (2a on Figure 2) is inherent in the structure of the and node labeled‘syl¬ 
lable’: both onset and coda positions are optionally related to a single consonant pho¬ 
neme. Anything more is ‘squeezed out’ structurally and either is neutralized with 
something else or is realized as zero. 

parse is inherent in the realizational relations (2b on Figure 2). To see this, con¬ 
sider the logic of an or relation like that specifying the realization of the liquid apical 
in Figure 2. The zero realization is always available. The apical tap [r] is realized in 
onset position, because the phonotactics supplies the instruction to tap the apex (Tap: 
from the left-hand diamond related to the upward or). If no other coda-position con¬ 
sonant (i.e., /pkmn q/) is realized there, the apical lateral [1] is, because the pho¬ 
notactics supplies the instruction to provide lateral articulation (Lateral: from the 
right-hand diamond related to the upward or) in addition to the apical place of artic¬ 
ulation. If another coda-position consonant is available, that consonant takes prece¬ 
dence over the apical liquid. Then the other consonant is realized and the apical liquid 
is realized as zero. Similarly the [c] is realized if it is in syllable onset position. Other¬ 
wise it is realized as zero. There is no way to prevent the overt realization of one of 
these phonemes without adding relations to the diagram, and there is no indication 
that such additional relations are needed or can be justified. The logic of the network, 
simple as it is, provides the actual realizations automatically during syllabification. 

Now in the OT description, coda sonority parses (realizes) the n in anc ta ‘to 
sit’ and ^complex ensures that the c is not realized at all. But the mere occurrence of 
nasals and not c in coda position and the possible zero realization of c has the effect 
of combining the two constraints. 

In Figure 2 the ordered or (2d) ensures that the m and not the l is realized in [sam] 
‘life’. This effect is achieved in the OT description by peripherality. peripheral- 
ity is a constraint concerning place of articulation, whereas figure 2 shows an order¬ 
ing based partly on manner of articulation (Nasal) and partly on place of articulation 
(Apical). A complete and fully generalized description, one that includes onset posi¬ 
tion, would show the exact extent to which place and manner interact in Korean 
phonology. Limitations on space and on the author’s command of Korean phonol¬ 
ogy preclude this extension. Still, the effect of peripherality also emerges from this 
partially-generalized RN description. 
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Excepting only align-o, to which I return below, the constraints cited in Gavrin 
1999 all emerge from the RN description in Figure 2. The ranking of the constraints 
also emerges from Figure 2. "complex and parse are hierarchically dominant over all 
other constraints in Figure 2. In one sense they are logically equivalent, since the output 
is an intersection of the two sets. But ’''complex is a part of the structure of the syllable 
(2a). The syllable, in turn, provides the necessary conditioning environment for the 
realizations of the various phonemes. In fact, it governs those realizations. Thus "com¬ 
plex, as an expression of a part of syllable structure, takes precedence in rank over 
parse. The ordered or relations (2d and 2e) provide the relative ranking of peripher- 
ality over coda sonority. They are subordinate to the syllable, and hence to "com¬ 
plex. Moreover, these relations (and thus peripherality and coda sonority) only 
apply if the phonemes to which the or nodes are related are activated in the horizontal 
plane. Thus they are also subordinate to parse. This gives the ranking in (2). 

(2) COMPLEX » PARSE » PERIPHERALITY » CODA SONORITY 

Return now to align-o, which aligns the right edge of the stem with the right edge 
of the syllable. In the examples given in Table 1 (and in Gavrin 1999), this is an accu¬ 
rate requirement. This constraint is not part of any of the structural considerations 
that went into Figure 2. Properly speaking, it does not actually emerge from Figure 2. 
Rather, it is an automatic consequence of two facts: the general syllable structure in 
Figure 2 ([C] V[C]) and the fact that stems in the examples occur either without suffix 
morphemes or with suffix morphemes that begin with a consonant. It is, in short, an 
emergent constraint for a restricted set of examples. 

At the same time, align-o is at least incomplete. If a suffix morpheme begins with 
a vowel, e.g., -un ‘topic marker’, Figure 2 realizes kaps un ‘price-TOPic’ as [kap sun]. 
That is, the stem-final consonant is realized as [s] in onset position of the second syl¬ 
lable. Now the extension of Figure 2 that permits the realization of s in onset posi¬ 
tion provides both realizations, i.e., as [kap] if there is no suffix, and as [kap sun], 
as remarked above. In sum, the RN description predicts the correct syllabification as 
well as the alternate realizations of the morphemes simultaneously. If I understand it 
correctly, an OT description needs syllabification rules in addition to the constraints. 
Therefore, an RN description incorporates and supersedes align-o. 

Thus, all of the OT constraints invoked in Gavrin 1999 and their relative ranking 
emerge from a generalized RN description. 

5. regular variation between dialects. Gavrin (19997-8) cites dialects in which 
underlying ilk ta ‘to read’ is realized as [il fa] instead of as [ik t’a]. She accounts for this 
form by ranking coda sonority above peripherality. The RN description of these 
dialects is simple. The diagram in Figure 2 requires modification of the ordered or 
nodes labeled (2d) and (2e). That is, the RN description recognizes that two closely- 
related dialects may differ phonologically by slight differences in the structure of 
the syllable 12 . This is a further indication that the ranking between OT constraints 
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is a direct consequence of phonological structure, as the RN descriptions show. It 
also explains why the ranking may differ so radically from language to language. The 
structure of the syllable and Pword differ from language to language, so anything 
derivable from their structure must also differ from language to language. 

This brings us to a consideration of the more general theoretical consequences of 
OT’s findings and RN description. 

6. theoretical consequences. In this admittedly restricted but not artificial prob¬ 
lem, both the OT constraints and their ranking emerge from a generalized RN 
description that is focused on more fundamental linguistic considerations, e.g., sylla¬ 
ble and morphological structure. This means that Gavrins description of Korean con¬ 
sonant alternation is actually derivable from the RN description. Sullivan (ms.) shows 
that this is also true for a radically different problem in an unrelated language (accen- 
tology in Russian), as described in MW. I suspect that the same would be true in every 
case. An actual proof requires the examination of all proposed OT constraints and the 
relative rankings in the context of particular languages, but we can see some general 
indications now. 

First, OT constraints are of two kinds: universal 13 and specific to a language (or 
language family). Second, what differs most critically from language to language is 
the ranking between constraints. As remarked above, OT produces the correct output 
for the Russian examples in MW and the Korean examples, just as the cognitive-strat- 
ificational description does. There is, however, a clear advantage to the RN descrip¬ 
tion in the two cases. For the Russian case, it predicts the correct outcome without 
explicitly distinguishing between two types of M-stress and establishing a hierarchy 
between them. More generally, from an O-theoretic point of view, it also shows that 
the constraint parse in Gavrin 1999 and the two constraints stem and affix in MW 
are logically identical: a phoneme that alternates with zero. 

But this isn’t all. Whether OT is a basic theory standing on its own or a sub-theory 
of C-phon, the existence of the constraints and their ranking are both unexplained 
‘facts’ or theoretical postulates. Conversely, if OT constraints and their relative rank¬ 
ing emerge from an RN description in every case, then OT is actually derivable as a 
theorem or set of theorems of a fully-developed RN theory. Then the constraints and 
their ranking follow automatically, as I show in the present case 14 . Moreover, it is an 
inescapable conclusion that the postulates underlying RN theory are at a much more 
fundamental level than those underlying OT. Certainly the consequences of these 
observations relative to the universality of the constraints suggest that the relation¬ 
ship between presumed universals and individual languages requires deeper consid¬ 
eration in the light of relational networks. 

In short, it is long past time to turn away from the dead ends that ‘mainstream’ 
(Chomskyan) approaches have led to. We need to look seriously at methods of 
research that use the tools of hard science, neurology among them, and at RN 
approaches to description. Perhaps then the promise that everyone expected of lin¬ 
guistics a generation ago may at last start to be realized. 
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It is misnamed. It has always had transformations, initially called Prules, but it never had 
a generative component, as Sampson 1970 points out. 

Nowadays C-phon of the first 15-20 years is often called, rather condescendingly, ‘linear’ 
phonology, because more recent models incorporate hierarchical, non-linear features like 
autosegments (prosodies) and syllables. More recent C-phonologists forget the work of 
Trubetzkoy, Firth, Pike, Haugen, Hockett, and a host of other phonologists, not the least of 
whom was Jakobson himself. That is, the C-phonologists merely re-discovered the wheels 
that had powered phonological research for decades and had been abandoned after 1959. 
For example, the prohibition against crossed lines, effectively restricting these models to 
two-dimensional status. 

That is, the minimum number total and the minimum number per phoneme. 

Cf. Archangeli & Langendoen 1997 .1 want to thank Jeong-a Ahn Gavrin of the University 
of Florida for extended discussions on OT and the chance to read her paper on Korean. 
I also want to thank Irina Milnes and Caroline Wiltshire of the University of Florida for 
similar discussions and the opportunity to read preliminary drafts of their article applying 
OT to Russian stress. Any errors or misinterpretations are my responsibility. 

The examples and the OT description are taken from Gavrin 1999 .1 am dealing with only 
that portion of her description related to coda ‘simplification’ in the dialect of Korean that 
Mrs. Gavrin speaks natively. As a result, the present study does not contain sufficient con¬ 
text to justify its details. Because of space limitations, I present only this portion for the 
sake of illustration. 

When in onset position it is realized as tapped [r]. It has no ‘underlying form’ because the 
term ‘underlying form’ has no meaning in an RN approach. I use ? as a convenient symbol 
for it in Figure 2. 

I return to the question of dialect variation (Gavrin 1999:7) below. 

This constraint subsumes the same functions as stem and affix in MW. 

Although this constraint is found in many OT papers, it is, strictly speaking, inaccurate, or 
at least incomplete, sometimes begging the question of‘resyllabification’. In any event, the 
actual constraint in this case emerges from the RN description, like all the others. I explain 
further below. 

I am ignoring the fact that the onset of a following syllable takes precedence over the coda. 
It is easily accounted for in an RN description, but it plays no role in the present study. 
Compare the pronunciation of English ask in those dialects that neutralize it with axe. If 
this is a general phenomenon in those dialects, the most straight-forward way of account¬ 
ing for it is to ascribe it to the structure of coda position of the syllable. 

It is unlikely that any OT constraint is truly universal, unless it reflects human biology, 
physiology, or neurology or is based on general logical principles, all of which are non- 
linguistic. The question of possible universality remains to be determined empirically, but 
it is irrelevant here. 

I am told that C-lin has a number of‘hot’ issues (actually unsolved problems) right now, 
including the source of linear order in texts, the nature of lexical ‘features’, the source of 
pronouns, the mechanism of agreement, etc. Answers to all of these problems emerge 
from an RN approach that needn’t even be fully generalized. 
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SYLLABIFICATION IN PARALLEL IN SPANISH 


Caroline R. Wiltshire 
University of Florida, Gainesville 


syllabification and syllable-sensitive alternations in Spanish have long been 
used as an argument for derivational approaches, in which syllabifications and rules 
apply in sequence (Harris 1983,1993, Hualde 1992, Kaisse 1999). For example, a rule that 
realizes Is/ as [h] is said to apply in some dialects in coda position, if words are syllabi¬ 
fied both before and after the rule, as in (1), desigual ‘unequal’ and las osas ‘the bears’ 


(1) a la Harris (1993) 
output of first 
syllabification 
s-aspiration in coda 
resyllabification 


/des + igual/ 
(des)(i)(gwal) 


(de[h])(i)(gwal) 

(de)(hi)(gwal) 


/las # osa+s/ 

(las) (o)(sas) 

Qa[h]) (o)(sa[h]) 
(la) (ho) (sah) 


Reanalyses in Optimality Theory (Kenstowicz 1996, Colina 1997, Face 1999) avoid 
resyllabification by using constraints that directly compare output forms, but the use 
of such constraints is a continuing source of controversy (Hale, Kissock, and Reiss 
1998). I propose to analyze Spanish syllabification using an OT approach that relies 
instead on constraints that preserve the integrity of prosodic constituents. I show 
how syllabification within and across words can be accomplished in a single step, by 
the parallel evaluation of such constraints, and handle segmental alternations such 
as s-aspiration in terms of surface phonotactics, rather than as processes ordered 
between levels of syllabification. I begin by reviewing the syllabification data in Sec¬ 
tion 1, followed by my proposal in 2. In 3 ,1 briefly contrast my approach with previ¬ 
ous analyses, both rule-based and OT, and conclude with advantages of the approach 
described here. 


1. data. Syllabification in Spanish tends to obey two phonological pressures found 
widely cross-linguistically: syllables prefer onsets over codas, and syllable edges tend 
to align with word edges. Word-internally, where there is no pressure to align with 
word edges, the preference for onsets over codas results in the syllabification of single 
intervocalic consonants in onset position (2)a-b, and the maximization of onset clus¬ 
ters, where possible by the phonotactics (2)c-d. 


(2) 

a. /asules/ 

[.a.su.les.] 

*[. as.ul.es.] 

‘blue’ 


b. /komida/ 

[.ko.mi.Sa.] 

*[.kom.i.8a.] 

‘food’ 

(2) 

c. /ablar/ 

[.a.JMar.] 

*[.aj 3 . 1 ar.] 

‘talk’ 
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(2) d. /peregrino/ [.pe.re.Yri.no.] *[.pe.re Y-r i.no. 1 ‘pilgrim’ 

The two pressures conflict, however, when a word-final consonant precedes a word- 
initial vowel. In that case, a word-final consonant is syllabified into an onset with the 
following vowel, as in (3). 


(3) a. /grandes#oxos#asules/ 

b. /asul#oskuro/ 

c. /klub#elegante/ 

d. /kalor#insoportable/ 


[ .gran. de. so.xo. sa- su.les. ] 

l.a.su.los.ku.ro.l 

[.klu.J 3 e.le.Yan.te.] 

[ .ka.lo. rin. so.por. ta. ( 31 e. ] 


‘big blue eyes’ 
‘dark blue’ 
‘elegant club’ 
‘unbearable heat’ 


If the following word is consonant-initial, so that it already has an onset, syllabification 
aligns with the word boundaries, and any word-final consonant is in the coda, (4). 


(4) a. /klub#lindo/ [,klu( 3 . 1 in.do.l *[klu.J 31 in.do.] 

b. /cef#loko/ [.ceflo.ko.] *[ce.flo.ko.] 

c. /benid#rapido/ [.be.niSra.pi.do.] *[be.ni.Sra.pi.do.] 


‘beautiful club’ 
‘crazy chef’ 
‘come quickly’ 


This gives the appearance of different rules of syllabification in phrases than word- 
internally, since a cluster such as /bl/ is treated as a good onset within a word 
([.a.J 31 ar.]), but as a coda plus onset in a phrase ([.kluJ 3 Jin.do.]). However, the gener¬ 
alization is that word edges coincide with syllable edges unless a syllable would lack 
an onset. The same generalization holds across prefix-edges. Unless a syllable would 
lack an onset (5)a-b, prefix edges coincide with syllable edges (s)c-d. 


(5) a. /des+igual/ 

b. /sub+alterno/ 

c. /sub+lu.nar/ 

d. /ad+risar/ 


[.de.si.gwal.] 
[.su.J 3 al. ter.no.] 
[,su( 3 . 1 u.nar.l 
[.ad.ri.sar.l 


*[.des.i.gwal.] 

*[.su( 3 .a.l.ter.no.] 

*[.su.( 31 u.nar] 

*[.a.Sri.sar.] 


‘unequal’ 

‘subordinate’ 

‘sublunar’ 

‘to right (nautical)’ 


2. analysis. I first analyze syllabification within simple words and across words. I 
then extend the analysis to prefixed words, and, finally, show how the structures pro¬ 
posed for syllabification help in analyzing alternations that had been treated as rules 
serially ordered with resyllabification. 


2.1. syllabification within and across words. The analysis of syllabification in 
phrases is straightforward in an OT account. The pressure for syllables to have onsets 
is the onset constraint (6)a. The pressure for word edges to align with syllable edges 
is captured by two alignment constraints (6)b and (6)c, parametrized for the left/right 
edge of the word. Although ultimately low ranking in Spanish, a constraint against 
codas, (6)d, is included for completeness. 


(6) a. ONSET 


Syllables begin with a consonant. 
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Input /ablar/ 

ONSET 

NOCODA 

align-r(pw,o) 

align-l(pw,o) 

[.a.( 3 . 1 ar] 

x- 




[.a( 3 . 1 ar] 

x- 

*! 



[.a( 31 .ar] 

**! 

** 




Tableau 1. Word medial onset maximization 4 . 


Input /klubfelegente/ 

ONSET 

NOCODA 

align-r(pw,ct) 

align-l(pw,g) 

[klu.( 3 ] [e.le.yan.te] 


* 

X- 

X- 

[klu( 3 ] [.e.le.yan.te] 

*! 





Tableau 2. Phrase-medial misalignment to provide onset. 


Input /klubflindo/ 

ONSET 

align-r(pw,o) 

align-l(pw,ct) 

NOCODA 

[klu.p][lin.do] 


*! 

x- 

X- 

is" Ikluft.l Ilin.dol 




x-x- 


Tableaus. Phrase-medially, no misalignment to maximze onsets. 


(6) b. align-l(Wc1 ,0) The left edge of the word and the syllable coincide. 

c. align- R(Wd,o) The right edge of the word and the syllable coincide. 

d. NoCoda Syllables end with a vowel. 

Within a word, onset and NoCoda decide the winner, and any ranking of the two 
results in maximization of the onset cluster (Tableau 1). Across word boundaries, 
a word-final consonant is syllabified with a word-initial vowel, violating the word- 
syllable alignment constraints; these are outranked by either onset or NoCoda (Tab¬ 
leau 2). As the candidate aligning word and syllable edges has more violations of 
both onset and NoCoda than the candidate that violates word-syllable alignment, we 
cannot further rank the constraints from this example. However, when a consonant- 
final word precedes a consonant-initial word, the ranking becomes clear. In these 
cases, word-syllable alignment is satisfied at the expense of a NoCoda violation (Tab¬ 
leau 3). Thus the ranking that gives us maximal onsets within words and syllabifica¬ 
tion across words only in cases in which a word would otherwise be onsetless is given 
in (7). Though the word alignment constraints are outranked, they assert themselves 
if Onset is already satisfied. 

(7) ONSET » ALIGN-R, ALIGN-L » NOCODA 

2.2 prefixation. To handle syllabification in prefixed forms, we require an analysis 
of the structure of prefixed words which is parallel in some sense to that of words in 
phrases. However, prefixes do not stand alone as words and must be kept structurally 
distinct from words. Following suggestions in Selkirk (1995) and Peperkamp (1997), 
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a. 

morpheme + suffix 

[.a.bl+a+r.] PW , [,me.s+eh.] PW 

b. 

prefix + base 

[.de.s[i.gual.]p W ] PW , [.sub.[lu.nar.]pw]pw 

c. 

words in phrases 

[.klub.] PW [lin.do] PW 


Table 1. Prosodic structure of words in Spanish. 

Prosodic Phrase (Ph) 

Prosodic Word (PW) 

a 

Figure 1. Simplified prosodic hierarchy (Selkirk 1984). 

I treat the prosodic word (PW) structure of Spanish as different for prefixed and suf¬ 
fixed words. As shown in Table 1, a suffix coheres to the base within a single PW (row 
a) with no internal prosodic boundaries. A prefix, however, generates a recursive PW 
structure, (row b), so that a PW edge intervenes between a prefix and the base. The 
base is a PW, and the combination of prefix and base is also a PW, but the prefix alone 
is not. Words in a phrase are distinct PWs, (row c). 

Peperkamp (1997) shows that prefixes are not incorporated into the base prosodic 
word, as suffixes are, based on two rules of Spanish which apply at the left edge of 
prosodic words: e-epenthesis and r-strengthening. These rules apply regardless of the 
presence of a prefix before the base, which therefore must begin its own prosodic 
word 6 . That the prefix is not an entire prosodic word itself is supported by its lack of 
stress and further motivated by variation in aspiration locations. However, the struc¬ 
ture proposed in Table i(b) violates the Strict Layer Hypothesis (Selkirk 1984), which 
is based on the Prosodic Hierarchy; a simplified version appears in Figure 1. 

The Strict Layer Hypothesis (SLH) proposes that every element lower in the hier¬ 
archy is properly dominated by an element one level higher (Selkirk 1984); for use in 
OT, this has been broken into violable subconstraints (Selkirk 1995, Peperkamp 1997). 
Selkirk (1995) also argues that the SLH places distinct conditions on lexical words, 
such as nouns and verbs, vs. non-lexical words, such as function words and prefixes. 
In alignment terms, such conditions require the edges of lexical words and prosodic 
words to align, as in Table 2(a) and (b), as well as a constraint that limits recursivity, 
(c), so that one element cannot dominate another on the same level of the hierarchy. 

The recursive structure in Table i(b) is motivated by the constraint aligning the 
left edge of lexical words to PWs (Table 2(b)); a base word is a lexical word, while a 
prefix is not. The ranking of the constraint that requires PWs to end at the end of lexi¬ 
cal words prevents prefixes and suffixes from forming PWs themselves (b). To satify 
these constraints, the constraint against recursive PWs, (c), is violated. With (a) and 
(b) ranked above (c), a recursive word structure is preferred to incorporation of a 
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a. 

ALiGN-L(Lex,PW) 

Align the left edge of every lexical word with the left 
edge of some prosodic word. 

b. 

ALiGN-L/R(PW,Lex) 8 

Align the left/right edge of every prosodic word with 
the left/right edge of some lexical word. 

c. 

NonRecursivity 

No C. dominates C., j=i. 

1 fj 


Table 2. 


Input /sub+lunar/ 

ALIGN-L(Lex,PW) 

ALIGN-L(Lex,PW) 

NONRECURSIVITY 

[sublunar] PW 

*! 

*(L) 


[sub] p^/- [lunar] p^Y 


*(L)*(R)! 


"S’ [sub [lunar] PW ] PW 


*(L) 

* 


Tableau 4. Phrase-medial misalignment to provide onset. 


Input /mes+es/ 

ALIGN-L(Lex,PW) 

ALIGN-L(Lex,PW) 

NONRECURSIVITY 

"S' [meses] PW 


*(R) 


[mes]p W [es] PW 


W(L)! 


[[mes PW ]es] PW 


*(L) 

*! 


Tableau 5. Suffixed word structure. 


Input /sub+ordinado/ 

ONSET 

align-r(pw, a) 

align-l(pw, a) 

NOCODA 

"S' [.su.blor.dinado.ll 



* 

* 

[.sub.[or.di.na.do.]] 




** 


Tableau 6. Provide onsets in complex words. 


prefix into the PW with the base (Tableau 4). Suffixes do not interfere with left edge 
alignment of a lexical word, and are incorporated into the base (Tableau 5). 

The structures in Table 1 also allow us a simple explanation of syllabification in 
the prefixes, using the constraints from 2.1. The constraint forcing the alignment 
of the left edges of words and syllables, align-l(pw,o) , accounts for the failure 
to syllabify across prefix boundaries as well. As onset outranks align-l(pw,o), a 
consonant does syllabify across a prefix (word) boundary if the following PW is 
vowel-initial, as in Tableau 6. Before consonant-initial words, where onset is already 
satisfied, alignment is preserved at the expense of a NoCoda violation, as in Tableau 
7 (overleaf). 

In addition to accounting for the parallels of syllabification across word and prefix 
boundaries, the proposed prosodic word structures enable us to correctly account 
for the various locations of Is/ aspiration and Ini velarization in different dialects, to 
which I now turn. 
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Input /sub+lu.nar/ 

ONSET 

align-r(pw, a) 

align-l(pw, ct) 

NOCODA 

[.su.J 3 jlu.nar.]] 



* 

* 

[.su( 3 .[lu.nar]] 




** 


Tableau 7. Don’t maximize onsets in complex words. 


2.3. alternations in parallel with syllabification. Spanish syllabification has 
attracted attention because the allophonics of consonants may depend on word posi¬ 
tion as well as coda position. This has led to analyses that order syllabification, allo- 
phonic rules, and resyllabification. My proposal in 2.2 provides for a surface-based 
analysis of allophonics such as s-aspiration and n-velarization, and I sketch here a 
parallel analyses of these two phenomena using constraints at the edges of PWs to 
account for alternations. In some dialects, both phenomena occur prefix- and word- 
finally, even if the /si or Ini is in onset positions ([de.hi.gual] ‘unequal, [e.rja.mo.ra] 
enamors’). First, as in (8), dialects differ on where Is/ is realized as [h], though there 
is no variation in the realization of Is/ before a consonant; only [h] appears. 

(8) Typology of aspirating varieties of Spanish from Kaisse (1998): 

All aspirating varieties: 


s —> h: morpheme-internal,_C 

stem-final,_C 

a. Buenos Aires Argentinian: 


s —> h: prefix-final,_+C 

word-final,_C 

[deh [cremada] PW ] PW 

[bicoh] PW [raros] PW , 

[des[igual] PW ] PW 

[bicos] P w[ ehtranos] P w 

b. Rio Negro Argentinian and Caribbean 

II: 

s —> h: prefix-final,_+C 

word-final,_#{C,V} 

[deh [cremada] PW ] PW , 

[bicoh] PW [raroh] P w , 

[des[igual] PW ] PW 

[bicoh] PW [ehtranoh] PW 

c. Caribbean Spanish I (Harris 1993) 


s —» h: prefix-final,_+{C,V} 

word-final,_#{C,V} 

[deh [cremada] PW ] PW , 

[bicoh] PW [raroh] PW , 

[deh [igual] P w ] PW 

[bicoh] PW [ehtranoh] PW 

Using the structures developed in 2.2, a suffix coheres to the base within a single PW, 

with no internal boundaries, so that morpheme-internal and morpheme-final Is/ act 

the same in all dialects. In a prefixed form, the PW edge which intervenes between a 


prefix-final Is/ and the base may cause aspiration. Two PW edges follow a word-final 
/s/, which may also be aspirated. Thus in all the aspirating dialects of Spanish, we need 
constraints against [s] in preconsonantal position, (9)a, and for some, the constraints 
against [s] before word edges, (9)b and c, play an active role: 

(9) a. *sC No pre-consonantal alveolar fricatives. 

b. *s] PW : No [s] before the right-edge of PW (i.e., word-finally) 
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Structure & Inputs 

Candidate Outputs 

*sC 

*h 

FAITHPL 

*ol 

S| PW 

* s lpw 

[morphinternal] PW 

/disc-o/ 

/visi-o/ 

[dis.co] 

*! 





"S’ [dih.co] 


x- 

x- 



"S’ [vi.sio] 






[vi.hio] 


*! 

*! 



[morph - suf] PW 

/dies-mo/ 

/mes-es/ 

[dies.mo] 

*! 





"S’ [dieh.mo] 


x- 

x- 



•s’ [me.seS] 


x- 

x- 



[me.heS] 


x-x- f 

x-x- 1 




Tableau 8. Isl ->[h] in all aspirating dialects. 


(9) c. *s| PW : No [s] before any PW edge (left or right) 

When *sC ranks above faithfulness and a constraint barring h, no matter how the 
other constraints are ranked, morpheme-internal and pre-suffixal /s/s are realized 
as [h] in dialects which aspirate. Using the same word and syllable structure for all 
varieties of Spanish, ranking the constraints in (io)b-c give the differences in dialects 
found in (8). The three rankings in Tableau 9 (overleaf) have been developed for three 
types of dialect by Wiltshire (in press). 

Thus syllabification remains the same in the different dialects, while the location of 
aspiration differs. The same holds for In/ velarization; In/ is realized in some dialects 
as [r]j in certain positions, including word and prefix finally, as in the Granada dialect 
(Hualde 1992) in (10). 


(10) PW internal: 

a. 

/fin+es/ 

[.fin.eh.] 

‘ends’ 


b. 

/enano/ 

[.e.na.no.] 

‘dwarf’ 

Across words 

c. 

/sin# fin/ 

[.siiyfiiy] 

‘without end’ 


d. 

/sin#eso/ 

[.si.pe.so.] 

‘without that’ 


e. 

/bien#esta/ 

[.bje.qeh.ta.] 

‘well-being’ 

Across prefixes 

f. 

/en+amora/ 

[.e.pa.mo.ra.] 

‘enamors’ 


g- 

/in+umano/ 

[.i.pu.ma.no.] 

‘inhuman’ 


While previous accounts have treated velarization as involving rules interspersed 
with resyllabification, the same kind of analysis as presented for the aspiration of Is/ 
can be applied in a parallel approach to these data, as illustrated in Tableau 10 (over¬ 
leaf). 

Again, syllabification is the same across dialects, but the realization of consonan¬ 
tal allophones can differ based on their position in word structure, using a parallel 
analysis in a single step. 
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Buenos Aires Argentinian 

Structure & Inputs 

Candidate Outputs 

*sC 

* s 1pw 

*h 

FAITHPL 

*cl 

S| PW 

[pre [morphl] PW ] PW 
/des+igual/ 

«*• [de.s[igual]] 




X- 


[de.h[igual]] 


*! 

*! 



[word] PW [word] PW 

/bicos/ /estranos/ 

■a? [bico.s] [ehtranos] 


x- 

X- 

X- 

X- 

[bico.h] [ehtranos] 


x-x-l 




Rio Negro and Caribbean II 

Structure & Inputs 

Candidate Outputs 

*sC 

* s 1pw 

*h 

FAITHPL 

x- c | 

S| PW 

[pre [morphl] PW ] PW 
/des+igual/ 

"*■ [de.s[igual]] 





x- 

[de.h[igual]] 



*! 

*! 


[word] PW [word] PW 

/bicos/ /estranos/ 

[bico.s] [ehtranos] 


*! 

x-x- 

x-x- 

X- 

■s’ [bico.h] [ehtranos] 



x-x-x- 

x-x-x- 


Caribbean I 

Structure & Inputs 

Candidate Outputs 

*sC 

* s 1pw 

*h 

FAITHPL 

*cl 

S| PW 

[pre [morphl] PW ] PW 
/des+igual/ 

[de.s[igual]] 



*j 



os' [de.h[igual]] 




X- 

x- 

[word] PW [word] PW 

/bicos/ /estranos/ 

[bico.s] [ehtranos] 


*! 

*! 

X-X- 

x-x- 

•s’ [bico.h] [ehtranos] 




x-x-x- 

x-x-x- 


Tableau 9. Variation in Is/ —>[h] before vowels. 


Structure & Inputs 

Candidate Outputs 

*n] PW 

*cl 

s Ipw 

*nC 

*i) 

FAITHPL 

[morphinternal] PW 

/fin+es/ ‘ends’ 

" 3 " [.fin.eh.] 






[.fi.qeh] 


x- 

x- 



[pre [morph] PW ] PW 
/en+amora/ ‘enamors’ 

[,e.n[a.mo.ra.]] 






•s’ [.e.ij[a.mo.ra.]] 




x- 

x- 

[word] PW [word] PW 

/sin/ /eso/ ‘without 
that’ 

[.si.n[e.so.]] 

*! 

*? 




•s' [,si.ij[e.so.]] 




x- 

x- 


Tableau 10. Ini —>[y] at prosodic boundaries. 


3. comparison and conclusions. Previous attempts within rule-based approaches 
and OT have proven to be more complex or less adequate in coverage. Rule-based 
approaches divide syllabification into stages, building the nucleus, simple onset, com¬ 
plex onset, and coda; these apply in order word-internally (Hualde 1992, Harris 1993). 
After words are put into phrases, the simple onset rule reapplies, but the complex 
onset rule does not. As Colina (1997) notes, this is an ad hoc restriction, unrelated 
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to the motivation of the onset constraint. Furthermore, for aspiration and velariza- 
tion, rule-based approaches must order syllabification both before and after aspira¬ 
tion (Kaisse 1999). Though this simplifies the environment for the rule of aspiration, 
such accounts suffer from unmotivated complexity, adding ordering which results in 
abstract syllabifications, and adding opacity, since surface [h] s appear in positions 
which would not trigger the rule (onset, not coda). 

In an OT approach, rather than language particular restrictions on the application 
of the complex onset rule, we call upon cross-linguistically motivated constraints to 
distinguish word-internal from cross-word syllabification, and we syllabify only once. 
In my account, the aspiration and velarization contexts are directly related to the sur¬ 
face forms. Previous accounts in OT (Kenstowicz 1996, Colina 1997, Face 1999) can 
also claim some of the same advantages: they avoid resyllabification and refer only to 
output. However, they generally assume that aspiration is tied to coda position and 
add constraints that compare one output form to another. Kenstowicz (1996) extends 
correspondence constraints from Input-Output and Base-Reduplicant relationships 
(McCarthy and Prince 1995) to Uniform Exponence. This constraint requires lexical 
items that do not occur independently to match their own output shape in other 
forms, as in (14). 

(14) uniform exponence (UE): minimize the differences in the realization of 

a lexical item 

Kenstowicz uses UE to compare the realization of the prefix /des/ before a vowel- 
initial word with its realization before a consonant-initial one. Since /des/ does not 
occur independently, the interpretation of the constraint is problematic; how does the 
prefix know to match its realization before a consonant-initial word, rather than vice 
versa? Furthermore, to avoid applying UE to pairs with suffixes ([meh], *[meheh] for 
‘month, months’), Kenstowicz claims that UE is defined for specific morphemes only, 
rather than for all prefixes or words. Colina (1997) attempts to improve this account 
by using a prosodic and a segmental version of UE and claims UE need not be speci¬ 
fied for prefixes. For dialect differences, however, she notes that UE might be reranked 
or use a different allomorph as a basis for comparison in different dialects, again 
revealing that there is no principled basis for the choice of allomorph on which the 
constraint is evaluated. 

Face (1999) avoids stipulating that UE as applies only to specific morphemes by 
analyzing prefixes as independent phonological domains (PDs), based on the obser¬ 
vation that prefixes act differently from inflectional suffixes in syllabification. He 
modifies UE to the constraint UE-PD (15), and claims that UE-PD forces prefix-final 
Is/ to match the form of the prefix in preconsonantal position, where aspiration is 
forced by some high ranking constraint. 

(15) UE-PD: minimize the differences in the realization of a phonological 

domain (Face 1999) 
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However, the PD analysis gives prefixes the same status as words, leaving their lack of 
stress to stipulation. Apart from problems with the term PD, the problem of dialect 
variation remains. Face’s analysis requires that in dialects such as Caribbean I, prefixes 
are analyzed as PDs so that prefix-final Is/ will aspirate. Equating prefixes and words 
predicts that aspiration word-finally is accompanied by aspiration prefix-finally. This 
is not the case in dialects such as Rio Negro and Caribbean II. Also, as Face explains 
both syllabification and aspiration by‘PDs’, changing their structure for different aspi¬ 
ration facts would also result in different syllabification facts. Such an account is 
therefore unable to simultaneously handle the similarities as well as the differences in 
different dialects. 

In my analysis, I have used independently motivated constraints on prosodic 
structures to account for both syllabification and dialect variation. There are several 
advantages to this account. I have captured the different locations of aspiration and 
velarization in different dialects with a uniform syllabification and uniform prosodic 
word structure for all varieties of Spanish. We do not need rule ordering and abstract 
syllabifications, which lead to opacity and which require derivational levels rather 
than a fully parallel analysis. Finally, the interpretation of prosodic constituent integ¬ 
rity as alignment allows for an OT account without questionable extensions to UE 
constraints. 


I thank the audience at lacus 2000 and anonymous reviewers for useful comments 
and Irene Moyna for additional examples from Spanish. I take full responsibility for any 
remaining errors of data or analysis. 

In the data that follows, ‘. ’ is used for syllable boundaries, “#’ for word boundaries, and “+’ 
for morpheme boundaries. Accent marks have been omitted throughout the paper, as the 
analysis here does not address stress. In (2)—(5), clusters or syllables of particular interest 
are underlined. For onset phonotactics, see Colina (1997). 

As a reviewer rightly points out, the use of the terms left and right are artificial when 
applied to spoken languages; the edges being referred to are the beginning and end of the 
unit (syllable, word, utterance). As ‘left’ and ‘right’ are entrenched in the theory, I use them 
here for compatibility with other work in OT. 

In the OT tableaux, is used to indicate a violation of the constraint above the column 
in which it appears. T indicates that a candidate output has lost because of that constraint 
violation. A violation of a constraint in a higher ranked column is worse than one in a 
lower ranked column, and multiple violations of the same constraint are worse than single 
violations; either can cause a candidate to lose to a more optimal candidate. 
Nunez-Cedeno and Morales-Front (1999) independently came to a similar description of 
these facts, though they only sketch the OT principles involved and do not develop the 
difference between prefixes and suffixes. 

The reader is referred to her work for details. Peperkamp claims only to show that mono¬ 
syllabic prefixes do not incorporate into the PW of the base, but not whether they are 
PWs themselves in Spanish. I argue that the difference among dialects in their treatment 
of prefix-final and word-final Is/ supports a recursive structure as correct for Spanish. 
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7 I diverge from Peperkamp’s analysis by allowing word and syllable boundaries to be mis¬ 
aligned, a violation of propernesting (Peperkamp 1997). See Wiltshire (1999) for argu¬ 
ments that this constraint should be violable. 

8 The constraint is a combination of two constraints, one aligning left edges of PWs and 
lexical words, and one aligning the right edges. The two need not be ranked independently 
here, so they have been combined, though I will indicate by the asterisk which edge is 
responsible for the violation. 

9 The prefixes ending in -s are bis-, des-, dis-, es-, ex-, minus-, plus-, pos- and tra(n)s-. I use 
des- here as representative. 

10 Although early accounts of Isl aspiration use preconsonantal position (Guitart 1979), 
recent accounts use coda position, following Harris (1983). Dialects such as BAA (Kaisse 
1998), in which pre-pausal Is/ is [s], result naturally from a preconsonantal description, 
but require an additional constraint if aspiration is coda-based. Though special con¬ 
straints for phrase-final position are possible, this option is not pursued here, pending 
further research. 
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